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ABSTRACT 


The invention describes a new method to sequence DNA. 
The improvements over the existing DNA sequencing tech- 
nologies are high speed, high throughput, no electrophoresis 
and gel reading artifacts due to the complete absence of an 
electrophoretic step, and no costly reagents involving vari- 
ous substitutions with stable isotopes. The Invention utilizes 
the Sanger sequencing strategy and assembles the sequence 
information by analysis of the nested fragments obtained by 
base-specific chain termination via their different molecular 
masses using mass spectrometry, as for example, MALDI or 
ES mass spectrometry. A further increase in throughput can 
be obtained by introducing mass-modifications in the oli- 
gonucleotide primer, chain-terminating nucleoside triphos- 
phates and/or in the chain-elongating nucleoside triphos- 
phates, as well as using integrated tag sequences which 
allow multiplexing by hybridization of tag specific probes 
with mass differentiated molecular weights. 

42 Claims, 36 Drawing Sheets 
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FIG. I6A 
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DNA SEQUENCING BY MASS 
SPECTROMETRY 

RELATED APPLICATIONS 

5 

This application is a continuation-in-part of U.S. appli- 
cation Ser. No. 08/001,323 filed Jan. 7, 1993, which is now 
abandoned, but which the contents of which are incorpo- 
rated herein by reference. 

10 

BACKGROUND OF THE INVENTION 

Since the genetic information is represented by the 
sequence of the four DNA building blocks deoxyadenosine- 
(dpA), deoxyguanosine-(dpG), deoxycytidine-(dpC) and 
deoxy thy midine-5 '-phosphate (dpT), DNA sequencing is 15 
one of the most fundamental technologies in molecular 
biology and the life sciences in general. The ease and the rate 
by which DNA sequences can be obtained greatly affects 
related technologies such as development and production of 
new therapeutic agents and new and useful varieties of 20 
plants and microorganisms via recombinant DNA technol- 
ogy. In particular, unraveling the DNA sequence helps in 
understanding human pathological conditions including 
genetic disorders, cancer and AIDS. In some cases, very 
subtle differences such as a one nucleotide deletion, addition 25 
or substitution can create serious, in some cases even fatal, 
consequences. Recently, DNA sequencing has become the 
core technology of the Human Genome Sequencing Project 
(e.g., J. E. Bishop and M. Waldholz, 1991, Genome; The 
Story of the Most Astonishing Scientific Adventure of Our 30 
Time — The Attempt to Map All the Genes in the Human 
Body, Simon & Schuster, New York). Knowledge of the 
complete human genome DNA sequence will certainly help 
to understand, to diagnose, to prevent and to treat human 
diseases. To be able to tackle successfully the determination 35 
of the approximately 3 billion base pairs of the human 
genome in a reasonable time frame and in an economical 
way, rapid, reliable, sensitive and inexpensive methods need 
to be developed, which also offer the possibility of automa- 
tion. The present invention provides such a technology. 40 

Recent reviews of today's methods together with future 
directions and trends are given by Barrel! (The FASEB 
Journal 5, 40-45 (1991)), and Trainor {Anal. Chem. 62, 
418-26 (1990)). 45 

Currently, DNA sequencing is performed by either the 
chemical degradation method of Maxam and Gilbert (Meth- 
ods in Enzymology 65, 499-560 (1980)) or the enzymatic 
dideoxynucleotide termination method of Sanger et al. 
(Proc. Natl Acad. Sci. U.S.A. 74, 5463-67 (1977)). In the 50 
chemical method, base specific modifications result in a base 
specific cleavage of the radioactive or fluorescently labeled 
DNA fragment. With the four separate base specific cleavage 
reactions, four sets of nested fragments are produced which 
are separated according to length by polyacrylamide gel 55 
electrophoresis (PAGE). After autoradiography, the 
sequence can be read directly since each band (fragment) in 
the gel originates from a base specific cleavage event. Thus, 
the fragment lengths in the four "ladders" directly translate 
into a specific position in the DNA sequence. gQ 

In the enzymatic chain termination method, the four base 
specific sets of DNA fragments are formed by starting with 
a primer/template system elongating the primer into the 
unknown DNA sequence area and thereby copying the 
template and synthesizing a complementary strand by DNA 65 
polymerases, such as Klenow fragment of E. coli DNA 
polymerase I, a DNA polymerase from Thermus aquaticus, 


Taq DNA polymerase, or a modified T7 DNA polymerase, 
Sequenase (Tabor et al., Proc. Natl. Acad. Sci. U.S.A. 84, 
4767—4771 (1987)), in the presence of chain-terminating 
reagents. Here, the chain-terminating event is achieved by 
incorporating into the four separate reaction mixtures in 
addition to the four normal deoxynucleoside triphosphates, 
dATP, dGTP, dTTP and dCTP, only one of the chain- 
terminatin g d ideoxynucleoside triphosphates, ddATP, 
ddGTP, ddTTP or ddCTP, respectively, in a limiting small 
concentration. The four sets of resulting fragments produce, 
after electrophoresis, four base specific ladders from which 
the DNA sequence can be determined. 

A recent modification of the Sanger sequencing strategy 
involves the degradation of phosphorothioate-containing 
DNA fragments obtained by using alpha-thio dNTP instead 
of the normally used ddNTPs during the primer extension 
reaction mediated by DNA polymerase (Labeit et al., DNA 
5, 173-177 (1986); Amersham, PCT- Application GB86/ 
00349; Eckstein et al., Nucleic Acids Res. 16, 9947 (1988)). 
Here, the four sets of base- specific sequencing ladders are 
obtained by limited digestion with exonuclease III or snake 
venom phosphodiesterase, subsequent separation on PAGE 
and visualization by radioisotopic labeling of either the 
primer or one of the dNTPs. In a further modification, the 
base- specific cleavage is achieved by alkylating the sulphur 
atom in the modified phosphodiester bond followed by a 
heat treatment (Max-Planck-Gesellschaft, DE 3930312 Al). 
Both methods can be combined with the amplification of the 
DNA via the Polymerase Chain Reaction (PCR). 

On the upfront end, the DNA to be sequenced has to be 
fragmented into sequencable pieces of currently not more 
than 500 to 1000 nucleotides. Starting from a genome, this 
is a multi-step process involving cloning and subcloning 
steps using different and appropriate cloning vectors such as 
YAC, cosmids, plasmids and M13 vectors (Sambrook et al., 
Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Laboratory Press, 1989). Finally, for Sanger 
sequencing, the fragments of about 500 to 1000 base pairs 
are integrated into a specific restriction site of the replicative 
form I (RF I) of a derivative of the Ml 3 bacteriophage 
(Vieria and Messing, Gene 19, 259 (1982)) and then the 
double-stranded form is transformed to the single-stranded 
circular form to serve as a template for the Sanger sequenc- 
ing process having a binding site for a universal primer 
obtained by chemical DNA synthesis (Sinha, Biernat, 
McManus and Koster, Nucleic Acids Res. 12, 4539-57 
(1984); U.S. Pat. No. 4,725,677 upstream of the restriction 
site into which the unknown DNA fragment has been 
inserted. Under specific conditions, unknown DNA 
sequences integrated into supercoiled double-stranded plas- 
mid DNA can be sequenced directly by the Sanger method 
(Chen and Seeburg, DNA 4, 1 65-170 (1985)) and Lim et al., 
Gene Anal. Techn. 5, 32-39 (1988), and, with the Poly- 
merase Chain Reaction (PCR) (PCR Protocols: A Guide to 
Methods and Applications, Innis et al., editors, Academic 
Press, San Diego (1990)) cloning or subcloning steps could 
be omitted by directly sequencing off chromosomal DNA by 
first amphTying the DNA segment by PCR and then applying 
the Sanger sequencing method (Innis et al., Proc. Natl. Acad. 
Sci. U.S.A. 85, 9436-9440 (1988)). In this case, however, the 
DNA sequence in the interested region most be known at 
least to the extent to bind a sequencing primer. 

In order to be able to read the sequence from PAGE, 
detectable labels have to be used in either the primer (very 
often at the 5 '-end) or in one of the deoxynucleoside 
triphosphates, dNTP. Using radioisotopes such as 32 P, 33 P, or 
35 S is still the most frequently used technique. After PAGE, 
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the gels are exposed to X-ray films and silver grain exposure 
is analyzed. The use of radioisotopic labeling creates several 
problems. Most labels useful for autoradiographic detection 
of sequencing fragments have relatively short half-lives 
which can limit the useful time of the labels. The emission 
high energy beta radiation, particularly from 32 P, can lead to 
breakdown of the products via radiolysis so that the sample 
should be used very quickly after labeling. In addition, high 
energy radiation can also cause a deterioration of band 
sharpness by scattering. Some of these problems can be 
reduced by using the less energetic isotopes such as 33 P or 
3S S (see, e.g., Ornstein et al., Biotechniques 3, 476 (1985)). 
Here, however, longer exposure times have to be tolerated. 
Above all, the use of radioisotopes poses significant health 
risks to the experimentalist and, in heavy sequencing 
projects, decontamination and handling the radioactive 
waste are other severe problems and burdens. 

In response to the above mentioned problems related to 
the use of radioactive labels, non-radioactive labeling tech- 
niques have been explored and, in recent years, integrated 
into partly automated DNA sequencing procedures. All these 
improvements utilize the Sanger sequencing strategy. The 
fluorescent label can be tagged to the primer (Smith et al., 
Nature 321, 674-679 (1986) and EPO Patent No. 
87300998.9; Du Pont De Nemours EPO Application No. 
0359225; Ansorge et al. 7. Biochem. Biophys. Methods 13, 
325-32 (1986)) or to the chain- terminating dideoxynuclo- 
side triphosphates (Prober et al. Science 238, 336-41 (1987); 
Applied Biosy stems, PCT Application WO 91/05060). 
Based on either labeling the primer or the ddNTP, systems 
have been developed by Applied Biosystems (Smith et al, 
Science 235, G89 (1987); U.S. Pat. Nos. 570,973 and 
689,013), Du Pont De Nemours (Prober et al., Science 238, 
336-341 (1987); U.S. Pat. Nos. 881,372 and 57,566), Phar- 
macia-LKB (Ansorge et al. Nucleic Acids Res. 15, 
4593-4602 (1987) and EMBL Patent Application DE 
P3724442 and P3 805808.1) and Hitachi (JP 1 -90844 and DE 
4011991 Al). A somewhat similar approach was developed 
by Brumbaugh et al. (Proc. Natl. Set U.S.A. 85, 5610-14 
(1988) and U.S. Pat. No. 4,729,947). An improved method 
for the Du Pont system using two electrophoretic lanes with 
two different specific labels per lane is described (PCT 
Application WO92/02635). A different approach uses fluo- 
rescently labeled avidin andbiotin labeled primers. Here, the 
sequencing ladders ending with biotin are reacted during 
electrophoresis with the labeled avidin which results in the 45 
detection of the individual sequencing bands (Brumbaugh et 
al, U.S. Pat. No. 594,676). 

More recently even more sensitive non-radioactive label- 
ing techniques for DNA using chemiluminescence trigger- 5Q 
able and amplifyable by enzymes have been developed 
(Beck, O'Keefe, Coull and Koster, Nucleic Acids Res. 17, 
5115-5123 (1989) and Beck and Kster, Anal. Chem. 62, 
2258-2270 (1990)). These labeling methods were combined 
with multiplex DNA sequencing (Church et al. Science 240, 55 
185-188 (1988) to provide for a strategy aimed at high 
throughput DNA sequencing (Koster et al., Nucleic Acids 
Res. Symposium Ser. No. 24, 318-321 (1991), University of 
Utah, PCT Application No. WO 90/15883); this strategy still 
suffers from the disadvantage of being very laborious and ^ 
difficult to automate. 

In an attempt to simplify DNA sequencing, solid supports 
have been introduced. In most cases published so far, the 
template strand for sequencing (with or without PCR ampli- 
fication) is immobilized on a solid support most frequently 65 
utilizing the strong biotin-avidin/streptavidin interaction 
(Orion- Yhtyma Oy, U.S. Pat. No. 277,643; M. Uhlen et al. 


40 


Nucleic Acids Res. 16, 3025-38 (1988); Cemu Bioteknik, 
PCT Application No. WO 89/09282 and Medical Research 
Council, GB, PCT Application No. WO 92/03575). The 
primer extension products synthesized on the immobilized 
template strand are purified of enzymes, other sequencing 
reagents and by-products by a washing step and then 
released under denaturing conditions by loosing the hydro- 
gen bonds between the Watson-Crick base pairs and sub- 
jected to PAGE separation. In a different approach, the 
primer extension products (not the template) from a DNA 
sequencing reaction are bound to a solid support via biotin/ 
avidin (Du Pont De Nemours, PCT Application WO 
91/11533). In contrast to the above mentioned methods, 
here, the interaction between biotin and avidin is overcome 
by employing denaturing conditions (formamide/EDTA) to 
release the primer extension products of the sequencing 
reaction from the solid support for PAGE separation. As 
solid supports, beads, (e.g., magnetic beads (Dynabeads) 
and Sepharose beads), filters, capillaries, plastic dipsticks 
(e.g., polystyrene strips) and microtiter wells are being 
proposed. 

All methods discussed so far have one central step in 
common: polyacrylamide gel electrophoresis (PAGE). In 
many instances, this represents a major drawback and limi- 
tation for each of these methods, freparing a homogeneous 
gel by polymerization, loading of the samples, the electro- 
phoresis itself, detection of the sequence pattern (e.g., by 
autoradiography), removing the gel and cleaning the glass 
plates to prepare another gel are very laborious and time- 
consuming procedures. Moreover, the whole process is 
error-prone, difficult to automate, and, in order to improve 
reproducibility and reliability, highly trained and skilled 
personnel are required. In the case of radioactive labeling, 
autoradiography itself can consume from hours to days. In 
the case of fluorescent labeling, at least the detection of the 
sequencing bands is being performed automatically when 
using the laser-scanning devices integrated into commercial 
available DNA sequencers. One problem related to the 
fluorescent labeling is the influence of the four different 
base-specific fluorescent tags on the mobility of the frag- 
ments during electrophoresis and a possible overlap in the 
spectral bandwidth of the four specific dyes reducing the 
discriminating power between neighboring bands, hence, 
increasing the probability of sequence ambiguities. Artifacts 
are also produced by base-specific interactions with the 
polyacrylamide gel matrix (Frank and Koster, Nucleic Acids 
Res. 6, 2069 (1979)) and by the formation of secondary 
structures which result in "band compressions" and hence do 
not allow one to read the sequence. This problem has, in 
part, been overcome by using 7-deazadeoxyguanosine triph- 
osphates (Barr et al., Biotechniques 4, 428 (1986)). How- 
ever, the reasons for some artifacts and conspicuous bands 
are still under investigation and need further improvement of 
the gel electrophoretic procedure. 

A recent innovation in electrophoresis is capillary zone 
electrophoresis (CZE) (Jorgenson et al., J. Chromatography 
352, 337 (1986); Gesteland et al., Nucleic Acids Res. 18, 
1415-1419 (1990)) which, compared to slab gel electro- 
phoresis (PAGE), significantly increases the resolution of 
the separation, reduces the time for an electrophoretic run 
and allows the analysis of very small samples. Here, how- 
ever, other problems arise due to the miniaturization of the 
whole system such as wall effects and the necessity of highly 
sensitive on-line detection methods. Compared to PAGE, 
another drawback is created by the fact that CZE is only a 
"one-lane" process, whereas in PAGE samples in multiple 
lanes can be electrophoresed simultaneously. 
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Due to the severe limitations and problems related to 
having PAGE as an integral and central part in the standard 
DNA sequencing protocol, several methods have been pro- 
posed to do DNA sequencing without an electrophoretic 
step. One approach calls for hybridization or fragmentation 5 
sequencing (Bains, Biotechnology 10, 757-58 (1992) and 
Mirzabekov et al., FEBS Utters 256, 118-122 (1989)) 
utilizing the specific hybridization of known short oligo- 
nucleotides (e.g., octadeoxynucleotides which gives 65,536 
different sequences) to a complementary DNA sequence. 10 
Positive hybridization reveals a short stretch of the unknown 
sequence- Repeating this process by performing hybridiza- 
tions with all possible octadeoxynucleotides should theo- 
retically determine the sequence. In a completely different 
approach, rapid sequencing of DNA is done by unilaterally 15 
degrading one single, immobilized DNA fragment by an 
exonuclease in a moving flow stream and detecting the 
cleaved nucleotides by their specific fluorescent tag via laser 
excitation (Jett et al., J. Biomolecular Structure & Dynamics 
7, 301-309, (1989); United States Department of Energy, 2Q 
PCT Application No. WO 89/03432). In another system 
proposed by Hyman (Anal. Biochem. 174, 423-^36 (1988)), 
the pyrophosphate generated when the correct nucleotide is 
attached to the growing chain on a primer-template system 
is used to determine the DNA sequence. The enzymes used 25 
and the DNA are held in place by solid phases (DEAE- 
Sepharose and Sepharose) either by ionic interactions or by 
covalent attachment. In a continuous flow-through system, 
the amount of pyrophosphate is determined via biolumines- 
cence (luciferase). A synthesis approach to DNA sequencing 30 
is also used by Tsien et al. (PCT Application No. WO 
91/06678). Here, the incoming dNTP's are protected at the 
3-end by various blocking groups such as acetyl or phos- 
phate groups and are removed before the next elongation 
step, which makes this process very slow compared to 35 
standard sequencing methods. The template DNA is immo- 
bilized on a polymer support. To detect incorporation, a 
fluorescent or radioactive label is additionally incorporated 
into the modified dNTP's. The same patent application also 
describes an apparatus designed to automate the process. ^ 

Mass spectrometry, in general, provides a means of 
"weighing" individual molecules by ionizing the molecules 
in vacuo and making them "fly" by volatilization. Under the 
influence of combinations of electric and magnetic fields, 
the ions follow trajectories depending on their individual 45 
mass (m) and charge (z). In the range of molecules with low 
molecular weight, mass spectrometry has long been part of 
the routine physical-organic repertoire for analysis and 
characterization of organic molecules by the determination 
of the mass of the parent molecular ion. In addition, by 50 
arranging collisions of this parent molecular ion with other 
particles (e.g., argon atoms), the molecular ion is fragmented 
forming secondary ions by the so-called collision induced 
dissociation (CID). The fragmentation pattern/pathway very 
often allows the derivation of detailed structural informa- 55 
tion. Many applications of mass spectrometric methods in 
the known in the art, particularly in biosciences, and can be 
found summarized in Methods in Enzymology, Vol. 193: 
"Mass Spectrometry" (J. A. McCloskey, editor), 1990, Aca- 
demic Press, New York. 60 

Due to the apparent analytical advantages of mass spec- 
trometry in providing high detection sensitivity, accuracy of 
mass measurements, detailed structural information by CID 
in conjunction with an MS/MS configuration and speed, as 
well as on-line data transfer to a computer, there has been 65 
considerable interest in the use of mass spectrometry tier the 
structural analysis of nucleic acids. Recent reviews summa- 


rizing this field include K. H. Schram, "Mass Spectrometry 
of Nucleic Acid Components, Biomedical Applications of 
Mass Spectrometry" 34, 203-287 (1990); and P. F. Crain, 
"Mass Spectrometric Techniques in Nucleic Acid 
Research," Mass Spectrometry Reviews 9, 505-554 (1990). 
The biggest hurdle to applying mass spectrometry to nucleic 
acids is the difficulty of volatilizing these very polar biopoly- 
mers. Therefore, "sequencing" has been limited to low 
molecular weight synthetic oligonucleotides by deterrmning 
the mass of the parent molecular ion and through this, 
confirming the already known sequence, or alternatively, 
confirming the known sequence through the generation of 
secondary ions (fragment ions) via CID in an MS/MS 
configuration utilizing, in particular, for the ionization and 
volatilization, the method of fast atomic bombardment (FAB 
mass spectrometry) or plasma desorption (PD mass spec- 
trometry). As an example, the application of FAB to the 
analysis of protected dimeric blocks for chemical synthesis 
of oligodeoxynucleotides has been described (Koster et al. 
Biomedical Environmental Mass Spectrometry 14, 111-116 
(1987)). 

Two more recent ionization/desorption techniques are 
electrospray/ionspray (ES) and matrix-assisted laser desorp- 
tion/ionization (MALDI). ES mass spectrometry has been 
introduced by Fenn et al. (/. Phys. Chem. 88, 4451-59 
(1984); PCT Application No. WO 90/14148) and current 
applications are summarized in recent review articles (R. D. 
Smith et al., Anal. Chem. 62, 882-89 (1990) and B. Ardrey, 
Electrospray Mass Spectrometry, Spectroscopy Europe, 4, 
10-18 (1992)). The molecular weights of the tetradecanucle- 
otide d(CATGCCATGGCATG) (SEQ ID NO:l) (Covey et 
al. "The Determination of Protein, Oligonucleotide and 
Peptide Molecular Weights by lonspray Mass Spectrom- 
etry," Rapid Communications in Mass Spectrometry, 2, 
249-256 (1988)), of the 21-mer d(AAATTGTGCACATC- 
CTGCAGC) (SEQ ID NO:2) and without giving details of 
that of a tRN A with 76 nucleotides (Methods in Enzymology, 
193, "Mass Spectrometry" (McCloskey, editor), p. 425, 
1990, Academic Press, New York) have been published. As 
a mass analyzer, a quadrupole is most frequently used. The 
determination of molecular weights in femtomole amounts 
of sample is very accurate due to the presence of multiple 
ion peaks which all could be used for the mass calculation. 

MALDI mass spectrometry, in contrast, can be particu- 
larly attractive when a time-of-flight (TOF) configuration is 
used as a mass analyzer. The MALDI-TOF mass spectrom- 
etry has been introduced by Hillenkamp et al. ("Matrix 
Assisted UV-Laser Desorption/Ionization: A New Approach 
to Mass Spectrometry of Large Biomoleeules," Biological 
Mass Spectrometry (Burlingame and McCloskey, editors), 
Elsevier Science Publishers, Amsterdam, pp. 49-60, 1990.) 
Since, in most cases, no multiple molecular ion peaks are 
produced with this technique, the mass spectra, in principle, 
look simpler compared to ES mass spectrometry. Although 
DNA molecules up to a molecular weight of 410,000 daltons 
could be desorbed and volatilized (Williams et al., "Vola- 
tilization of High Molecular Weight DNA by Pulsed Laser 
Ablation of Frozen Aqueous Solutions," Science, 246, 
1585-87 (1989)), this technique has so far only been used to 
deterrnine the molecular weights of relatively small oligo- 
nucleotides of known sequence, e.g., oligothymidylic acids 
up to 18 nucleotides (Huth-Fehre et al., "Matrix- Assisted 
Laser Desorption Mass Spectrometry of Oligodeox- 
ythymidylic Acids," Rapid Communications in Mass Spec- 
trometry, 6, 209-13 (1992)) and a double-stranded DNA of 
28 base pairs (Williams et al., "Time-of-Flight Mass Spec- 
trometry of Nucleic Acids by Laser Ablation and Ionization 
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from a Frozen Aqueous Matrix," Rapid Communications in 
Mass Spectrometry, 4, 348-351 (1990)). In one publication 
(Huth-Fehre et ah, 1992, supra), it was shown that a mixture 
of all the oligothymidylic acids from n=12 to n=18 nucle- 
otides could be resolved. 

In U.S. Pat. No. 5,064,754, RNA transcripts extended by 
DNA both of which are complementary to the DNA to be 
sequenced are prepared by incorporating NTP's, dNTP's 
and, as terminating nucleotides, ddNTP's which are substi- 
tuted at the 5 '-position of the sugar moiety with one or a 
combination of the isotopes 12 C, r3 C, 14 C, r H, 2 H, 3 H, ls O, 
17 0 and lg O. The polynucleotides obtained are degraded to 
3 '-nucleotides, cleaved at the N-glycosidic linkage and the 
isotopically labeled 5 '-functionality removed by periodate 
oxidation and the resulting formaldehyde species deter- 
mined by mass spectrometry. A specific combination of 
isotopes serves to discriminate base-specifically between 
internal nucleotides originating from the incorporation of 
NTP's and dNTP's and terminal nucleotides caused by 
linking ddNTP's to the end of the polynucleotide chain. A 
series of RNA/DNA fragments is produced, and in one 20 
embodiment, separated by electrophoresis, and, with the aid 
of the so-called matrix method of analysis, the sequence is 
deduced. 

In Japanese Patent No. 59-131909, an instrument is 
described which detects nucleic acid fragments separated 25 
either by electrophoresis, liquid chromatography or high 
speed gel filtration. Mass spectrometric detection is 
achieved by incorporating into the nucleic acids atoms 
which normally do not occur in DNA such as S, Br, I or Ag, 
Au, Pt, Os, Hg. The method, however, is not applied to 30 
sequencing of DNA using the Sanger method. In particular, 
it does not propose a base-specific correlation of such 
elements to an individual ddNTP. 

PCT Application No. WO 89/12694 (Brennan et al., Proc. 
SPIE-Int. Soc. Opt. Eng. 1206, {New Technol Cytom. Mol 35 
Biol), pp. 60-77 (1990); and Brennan,- U.S. Pat. No. 5,003, 
059) employs the Sanger methodology for DNA sequencing 
by using a combination of either the four stable isotopes 32 S, 
33 S, 34 S, 36 S or 35 C1, 37 C1, 79 Br, 81 Br to specifically label the 
chain-terminating ddNTP's. The sulfur isotopes can be 40 
located either in the base or at the alpha-position of the 
triphosphate moiety whereas the halogen isotopes are 
located either at the base or at the 3'-position of the sugar 
ring. The sequencing reaction mixtures are separated by an 
electrophoretic technique such as CZE, transferred to a 45 
combustion unit in which the sulfur isotopes of the incor- 
porated ddNTP's are transformed at about 900° C. in an 
oxygen atmosphere. The S0 2 generated with masses of 64, 
65, 66 or 68 is determined on-line by mass spectrometry 
using, e.g., as mass analyzer, a quadrupole with a single SQ 
ion-multiplier to detect the ion current. 

A similar approach is proposed in U.S. Patent No. 5,002, 
868 (Jacobson et al., Proc. SPIE-Int Soc. Opt. Eng. 1435, 
(Opt. Methods Ultrasensitive Detect. Anal. Tech. Appl), 
26-35 (1991)) using Sanger sequencing with four ddNTP's 55 
specifically substituted at the alpha-position of the triphos- 
phate moiety with one of the four stable sulfur isotopes as 
described above and subsequent separation of the four sets 
of nested sequences by tube gel electrophoresis. The only 
difference is the use of resonance ionization spectroscopy 60 
(RIS) in conjunction with a magnetic sector mass analyzer 
as disclosed in U.S. Pat. No. 4,442,354 to detect the sulfur 
isotopes corresponding to the specific nucleotide termina- 
tors, and by this, allowing the assignment of the DNA 
sequence. 65 

EPO Patent Applications No. 0360676 Al and 0360677 
Al also describe Sanger sequencing using stable isotope 
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substitutions in the ddNTP's such as D, 13 C, 1S N, 17 0, ls O, 
32 S, 33 S, 34 S, 3e S, I9 F, 35 C1, 37 C1, 79 Br, 81 Br and 127 I or 
functional groups such as CF 3 or Si(CH 3 ) 3 at the base, the 
sugar or the alpha position of the triphosphate moiety 
according to chemical functionality. The Sanger sequencing 
reaction mixtures are separated by tube gel electrophoresis. 
The effluent is converted into an aerosol by the electrospray/ 
thermospray nebulizer method and then atomized and ion- 
ized by a hot plasma (7000° to 8000° K.) and analyzed by 
a simple mass analyzer. An instrument is proposed which 
enables one to automate the analysis of the Sanger sequenc- 
ing reaction mixture consisting of tube electrophoresis, a 
nebulizer and a mass analyzer. 

The application of mass spectrometry to perform DNA 
sequencing by the hybridization/fragment method (see 
above) has been recently suggested (Bains, "DNA Sequenc- 
ing by Mass Spectrometry: Outline of a Potential Future 
Application," Chimicaoggi 9, 13-16 (1991)). 

SUMMARY OF THE INVENTION 

The invention describes a new method to sequence DNA. 
The improvements over the existing DNA sequencing tech- 
nologies include high speed, high throughput, no required 
electrophoresis (and, thus, no gel reading artifacts due to the 
complete absence of an electrophoretic step), and no costly 
reagents involving various substitutions with stable isotopes. 
The invention utilizes the Sanger sequencing strategy and 
assembles the sequence information by analysis of the 
nested fragments obtained by base-specific chain termina- 
tion via their different molecular masses using mass spec- 
trometry, for example, MALDI or ES mass spectrometry. A 
further increase in throughput can be obtained by introduc- 
ing mass modifications in the oligonucleotide primer, the 
chain-terminating nucleoside triphosphates and/or the chain- 
elongating nucleoside triphosphates, as well as using inte- 
grated tag sequences which allow multiplexing by hybrid- 
ization of tag specific probes with mass differentiated 
molecular weights. 

BRIEF DESCRIPTION OF THE FIGURES 

FIG. 1 is a representation of a process to generate the 
samples to be analyzed by mass spectrometry. This process 
entails insertion of a DNA fragment of unknown sequence 
into a cloning vector such as derivatives of M13, pUC or 
phagemids; transforming the double- stranded form into the 
single-stranded form; performing the four Sanger sequenc- 
ing reactions; linking the base-specifically terminated nested 
fragment family temporarily to a solid support; removing by 
a washing step all by-products; conditioning the nested DNA 
or RNA fragments by, for example, cation-ion exchange or 
modification reagent and presenting the immobilized nested 
fragments either directly to mass spectrometric analysis or 
cleaving the purified fragment family off the support and 
evaporating the cleavage reagent. 

FIG. 2A shows the Sanger sequencing products using 
ddTTP as terminating deoxynucleoside triphosphate of a 
hypothetical DNA fragment of 50 nucleotides (SEQ ID 
NO:3) in length with approximately equally balanced base 
composition. The molecular masses of the various chain 
terminated fragments are given. 

FIG. 2B shows an idealized mass spectrum of such a DNA 
fragment mixture. 

FIGS. 3A and 3B show, in analogy to FIGS. 2A and 2B, 
data for the same model sequence (SEQ ID NO: 3) with 
ddATP as chain terminator. 
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FIGS. 4A and 4B show data, analogous to FIGS. 2A and 
2B when ddGTP is used as a chain terminator for the same 
model sequence (SEQ ID NO: 3). 

FIGS. SA and SB illustrate the results obtained where 
chain termination is performed with ddCTP as a chain 5 
terminator, in a similar way as shown in FIGS. 2A and 2B 
for the same model sequence (SEQ ID NO:3). 

FIG. 6 summarizes the results of FIGS. 2A to 5B, showing 
the correlation of molecular weights of the nested four 
fragment families to the DNA sequence (SEQ ID NO: 3). io 

FIG. 7 illustrates the general structure of mass-modified 
sequencing nucleic acid primers or tag sequencing probes 
for either Sanger DNA or Sanger RNA sequencing. 

FIG. 8 shows the general structure for the mass-modified 
triphosphates for either Sanger DNA or Sanger RNA 15 
sequencing. General formulas of the chain-elongating and 
the chain-terminating nucleoside triphosphates are demon- 
strated. 

FIG. 9 outlines various linking chemistries (X) with either 
polyethylene glycol or terminally monoalkylated polyethyl- 20 
ene glycol (R) as an example. 

FIG. 10 illustrates similar linking chemistries as shown in 
FIG. 8 and depicts various mass modifying moieties (R). 

FIG. 11 outlines how multiplex mass spectrometric ^ 
sequencing can work using the mass-modified nucleic acid 
primer (UP). 

FIG. 12 shows the process of multiplex mass spectromet- 
ric sequencing employing mass-modified chain-elongating 
and/or terminating nucleoside triphosphates. 30 

FIG. 13 shows multiplex mass spectrometric sequencing 
by involving the hybridization of mass-modified tag 
sequence specific probes. 

FIG. 14 shows a MALDI-TOF spectrum of a mixture of 
oligothymidylic acids, d(pT) 12 „ 18 . 35 

FIG. 15 shows a superposition of MALDI-TOF spectra of 
the 50-mer d(TAACGGTCATTACGGCCATTGACTG- 
TAGGACCTGCATTACATGACTAGCT) (SEQ ID NO:3) 
(500 frnol) and dT(pdT) 99 (500 fmol). 

FIG. 16 shows the MALDI-TOF spectra of all 13 DNA 40 
sequences representing the nested dT-terminated fragments 
of the Sanger DNA sequencing simulation of FIG. 2, 500 
fmol each. 

FIG. 17 shows the superposition of the spectra of FIG. 16. 45 
The two panels show two different scales and the spectra 
analyzed at that scale. 

FIG. 18 shows the superimposed MALDI-TOF spectra 
from MALDI-MS analysis of mass-modified oligonucle- 
otides as described in Example 21. 50 

FIG. 19 illustrates various linking chemistries between 
the solid support (P) and the nucleic acid primer (NA) 
through a strong electrostatic interaction. 

FIG. 20 illustrates various linking chemistries between 
the solid support (P) and the nucleic acid primer (NA) 55 
through a charge transfer complex of a charge transfer 
acceptor (A) and a charge transfer donor (D). 

FIG. 21 illustrates various linking chemistries between 
the solid support (P) and the nucleic acid primer (NA) m 
through a stable organic radical. 

FIG. 22 illustrates a possible linking chemistry between 
the solid support (P) and the nucleic acid primer (NA) 
through Watson-Crick base pairing. 

FIG. 23 illustrates linking the solid support (P) and the 65 
nucleic acid primer (NA) through a photolytically cleavable 
bond. 
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DETAILED DESCRIPTION OF THE 
INVENTION 


This invention describes an improved method of sequenc- 
ing DNA. In particular, this invention employs mass spec- 
trometry, such as matrix-assisted laser desorption/ionization 
(MALDI) or electrospray (ES) mass spectrometry (MS), to 
analyze the Sanger sequencing reaction mixtures. 

In Sanger sequencing, four families of cham-terminated 
fragments are obtained. The mass difference per nucleotide 
addition is 289.19 for dpC, 313.21 for dpA, 329.21 for dpG 
and 304.2 for dpT, respectively. 

In one embodiment, through the separate determination of 
the molecular weights of the four base-specifically termi- 
nated fragment families, the DNA sequence can be assigned 
via superposition (e.g., interpolation) of the molecular 
weight peaks of the four individual experiments. In another 
embodiment, the molecular weights of the four specifically 
terminated fragment families can be determined simulta- 
neously by MS, either by mixing the products of all four 
reactions run in at least two separate reaction vessels (i.e., all 
run separately, or two together, or three together) or by 
running one reaction having all four chain-terminating 
nucleotides (e.g., a reaction mixture comprising dTTP, 
ddTTP, dATP, ddATP, dCTP, ddCTP, dGTP, ddGTP) in one 
reaction vessel. By simultaneously analyzing all four base- 
specifically terminated reaction products, the molecular 
weight values have been, in effect, interpolated. Comparison 
of the mass difference measured between fragments with the 
known masses of each chain-terminating nucleotide allows 
the assignment of sequence to be carried out. In some 
instances, it may be desirable to mass modify, as discussed 
below, the chain-terminating nucleotides so as to expand the 
difference in molecular weight between each nucleotide. It 
will be apparent to those skilled in the art when mass- 
modification of the chain-terminating nucleotides is desir- 
able and can depend, for instance, on the resolving ability of 
the particular spectrometer employed. By way of example, 
it may be desirable to produce four chain-terminating nucle- 
otides, ddTTP, ddCTP 1 , ddATP 2 and ddGTP 3 where 
ddCTP 1 , ddATP 2 and ddGTP 3 have each been mass-modi- 
fied so as to have molecular weights resolvable from one 
another by the particular spectrometer being used. 

The terms chain-elongating nucleotides and chain-termi- 
nating nucleotides are well known in the art. For DNA, 
chain-elongating nucleotides include 2-deoxyribonucle- 
otides and chain-terminating nucleotides include 2',3'- 
dideoxyribonucleotides. For RNA, chain-elongating nucle- 
otides include ribonucelotides and chain-terminating 
nucleotides include 3'-deoxyribonucleotides. The term 
nucleotide is also well known in the art. For the purposes of 
this invention, nucleotides include nucleoside mono-, di-, 
and triphosphates. Nucleotides also include modified nucle- 
otides such as phosphorothioate nucleotides. 

Since mass spectrometry is a serial method, in contrast to 
currently used slab gel electrophoresis which allows several 
samples to be processed in parallel, in another embodiment 
of this invention, a further improvement can be achieved by 
multiplex mass spectrometric DNA sequencing to allow 
simultaneous sequencing of more than one DNA or RNA 
fragment. As described in more detail below, the range of 
about 300 mass units between one nucleotide addition can 
be utilized by employing either mass-modified nucleic acid 
sequencing primers or chain-elongating and/or terminating 
nucleoside triphosphates so as to shift the molecular weight 
of the base-specifically terminated fragments of a particular 
DNA or RNA species being sequenced in a predetermined 
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manner. For the first time, several sequencing reactions can 
be mass spectrometrically analyzed in parallel. In yet 
another embodiment of this invention, multiplex mass spec- 
trometric DNA sequencing can be performed by mass modi- 
fying the fragment families through specific oligonucle- 5 
otides (tag probes) which hybridize to specific tag sequences 
within each of the fragment families. In another embodi- 
ment, the tag probe can be covalently attached to the 
individual and specific tag sequence prior to mass spectrom- 

etry ' 10 

In one embodiment of the invention, the molecular weight 
values of at least two base-specifically terminated fragments 
are determined concurrently using mass spectrometry. The 
molecular weight values of preferably at least five and more 
preferably at least ten base-specifically terminated frag- 15 
ments are determined by mass spectrometry. Also included 
in the invention are determinations of the molecular weight 
values of at least 20 base-specifically terminated fragments 
and at least 30 base-specifically terminated fragments. Fur- 
ther, the nested base-specifically terminated fragments in a 2Q 
specific set can be purified of all reactants and by-products 
but are not separated from one another. The entire set of 
nested base-specifically terminated fragments is analyzed 
concurrently and the molecular weight values are deter- 
mined. At least two base-specifically terminated fragments 25 
are analyzed concurrently by mass spectrometry when the 
fragments are contained in the same sample. 

In general, the overall mass spectrometric DNA sequenc- 
ing process will start with a library of small genomic 
fragments obtained after first randomly or specifically cut- 3 o 
ting the genomic DNA into large pieces which then, in 
several subcloning steps, are reduced in size and inserted 
into vectors like derivatives of M13 or pUC (e.g., M13mpl8 
or M13mpl9) (see FIG. 1). In a different approach, the 
fragments inserted in vectors, such as M13, are obtained via 35 
subcloning starting with a cDNA library. In yet another 
approach, the DNA fragments to be sequenced are generated 
by the polymerase chain reaction (e.g., Higuchi et al., "A 
General Method of in vitro Preparation and Mutagenesis of 
DNA Fragments: Study of Protein and DNA Interactions," 40 
Nucleic Acids Res., 16, 7351-67 (1988)). As is known in the 
art, Sanger sequencing can start from one nucleic acid 
primer (UP) binding to the plus-strand or from another 
nucleic acid primer binding to the opposite minus-strand. 
Thus, either the complementary sequence of both strands of 45 
a given unknown DNA sequence can be obtained (providing 
for reduction of ambiguity in the sequence determination) or 
the length of the sequence information obtainable from one 
clone can be extended by generating sequence information 
from both ends of the unknown vector-inserted DNA frag- 50 
ment. 

The nucleic acid primer carries, preferentially at the 
5'-end, a linking functionality, L, which can include a spacer 
of sufficient length and which can interact with a suitable 
functionality, L\ on a solid support to form a reversible 55 
linkage such as a photocleavable bond. Since each of the 
four Sanger sequencing families starts with a nucleic acid 
primer (L-UP; FIG. 1) this fragment family can be bound to 
the solid support by reacting with functional groups, L\ on 
the surface of a solid support and then intensively washed to 60 
remove all buffer salts, triphosphates, enzymes, reaction 
by-products, etc. Furthermore, for mass spectrometric analy- 
sis, it can be of importance at this stage to exchange the 
cation at the phosphate backbone of the DNA fragments in 
order to eliminate peak broadening due to a heterogeneity in 65 
the cations bound per nucleotide unit. Since the L — V 
linkage is only of a temporary nature with the purpose to 
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capture the nested Sanger DNA or RNA fragments to 
properly condition them for mass spectrometric analysis, 
there are different chemistries which can serve this purpose. 
In addition to the examples given in which the nested 
fragments are coupled covalently to the solid support, 
washed, and cleaved off the support for mass spectrometric 
analysis, the temporary linkage can be such that it is cleaved 
under the conditions of mass spectrometry, i.e., a photo- 
cleavable bond such as a charge transfer complex or a stable 
organic radical. Furthermore, the linkage can be formed with 
L' being a quaternary ammonium group (some examples are 
given in FIG. 19). In this case, preferably, the surface of the 
solid support carries negative charges which repel the nega- 
tively charged nucleic acid backbone and thus facilitates 
desorption. Desorption will take place either by the heat 
created by the laser pulse and/or, depending on L,' by 
specific absorption of laser energy which is in resonance 
with the L' chromophore (see, e.g., examples given in FIG. 
19). The functionalities, L and L,' can also form a charge 
transfer complex and thereby form the temporary L — L' 
linkage. Various examples for appropriate functionalities 
with either acceptor or donator properties are depicted 
without limitation in FIG. 20. Since in many cases the 
"charge- transfer band" can be determined by UV/vis spec- 
trometry (see e.g. Organic Charge Transfer Complexes by 
R. Foster, Academic Press, 1969), the laser energy can be 
tuned to the corresponding energy of the charge-transfer 
wavelength and, thus, a specific desorption off the solid 
support can be initiated. Those skilled in the art will recog- 
nize that several combinations can serve this purpose and 
that the donor functionality can be either on the solid support 
or coupled to the nested Sanger DNA/RNA fragments or 
vice versa. 

In yet another approach, the temporary linkage L — L' can 
be generated by homolytically forming relatively stable 
radicals as exemplified in FIG. 21. In example 4 of FIG. 21, 
a combination of the approaches using charge-transfer com- 
plexes and stable organic radicals is shown. Here, the nested 
Sanger DNA/RNA fragments are captured via the formation 
of a charge transfer complex. Under the influence of the laser 
pulse, desorption (as discussed above) as well as ionization 
will take place at the radical position. In the other examples 
of FIG. 21 under the influence of the laser pulse, the L — L' 
linkage will be cleaved and the nested Sanger DNA/RNA 
fragments desorbed and subsequently ionized at the radical 
position formed. Those skilled in the art will recognize that 
other organic radicals can be selected and that, in relation to 
the dissociation energies needed to homolytically cleave the 
bond between them, a corresponding laser wavelength can 
be selected (see e.g. Reactive Molecules by C. Wentrup, 
John Wiley & Sons, 1984). In yet another approach, the 
nested Sanger DNA/RNA fragments are captured via Wat- 
son-Crick base pairing to a solid support-bound oligonucle- 
otide complementary to either the sequence of the nucleic 
acid primer or the tag oligonucleotide sequence (see FIG. 
22). The duplex formed will be cleaved under the influence 
of the laser pulse and desorption can be initiated. The solid 
support-bound base sequence can be presented through 
natural oligoribo- or oligodeoxyribonucleotide as well as 
analogs (e.g. thio-modified phosphodiester or phosphotri- 
ester backbone) or employing oligonucleotide mimetics 
such as PNA analogs (see e.g. Nielsen et al., Science, 254, 
1497 (1991)) which render the base sequence less suscep- 
tible to enzymatic degradation and hence increases overall 
stability of the solid support-bound capture base sequence. 
With appropriate bonds, L — V, a cleavage can be obtained 
directly with- a laser tuned to the energy necessary for bond 
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cleavage. Thus, the immobilized nested Sanger fragments 
can be directly ablated during mass spectrometric analysis. 

To increase mass spectrometric performance, it may be 
necessary to modify the phosphodiester backbone prior to 
MS analysis. This can be accomplished by, for example, 5 
using alpha-thio modified nucleotides for chain elongation 
and termination. With alkylating agents such as akyliodides, 
iodoacetamide, pViodoethanol, 2,3-epoxy-l-propanol (see 
FIG. 10), the monothio phosphodiester bonds of the nested 
Sanger fragments are transformed into phosphotriester 10 
bonds. Multiplexing by mass modification in this case is 
obtained by mass-modifying the nucleic acid primer (UP) or 
the nucleoside triphosphates at the sugar or the base moiety. 
To those skilled in the art, other modifications of the nested 
Sanger fragments can be envisioned. In one embodiment of J5 
the invention, the linking chemistry allows one to cleave off 
the so-purified nested DNA enzymatically, chemically or 
physically. By way of example, the L — L' chemistry can be 
of a type of disulfide bond (chemically cleavable, for 
example, by mercaptoethanol or dithioerythrol), a biolin/ 2Q 
streptavidin system, a heterobifunctional derivative of a 
trityl ether group (Koster et al., "A Versatile Acid-Labile 
Linker for Modification of Synthetic Biomolecules," Tetra- 
hedron Letters 31, 7095 (1990)) which can be cleaved under 
mildly acidic conditions, a levulinyl group cleavable under 25 
almost neutral conditions with a hydrazinium/acetate buffer, 
an arginine-arginine or lysine-lysine bond cleavable by an 
endopepddase enzyme like trypsin or a pyrophosphate bond 
cleavable by a pyrophosphatase, a photocleavable bond 
which can be, for example, physically cleaved and the like 30 
(see, e.g., FIG. 23). Optionally, another cation exchange can 
be performed prior to mass spectrometric analysis. In the 
instance that an enzyme-cleavable bond is utilized to immo- 
bilize the nested fragments, the enzyme used to cleave the 
bond can serve as an internal mass standard during MS 35 
analysis. 

The purification process and/or ion exchange process can 
be carried out by a number of other methods instead of, or 
in conjunction with, immobilization on a solid support. For 
example, the base-specifically terminated products can be 40 
separated from the reactants by dialysis, filtration (including 
ultrafiltration), and chromatography. Likewise, these tech- 
niques can be used to exchange the cation of the phosphate 
backbone with a counter-ion which reduces peak broaden- 
ing. 45 

The base-specifically terminated fragment families can be 
generated by standard Sanger sequencing using the Large 
Klenow fragment of E. coli DNA polymerase I, by Seque- 
nase, Taq DNA polymerase and other DNA polymerases 
suitable for this purpose, thus generating nested DNA frag- 50 
ments for the mass spectrometric analysis. It is, however, 
part of this invention that base-specifically terminated RNA 
transcripts of the DNA fragments to be sequenced can also 
be utilized for mass spectrometric sequence determination. 
In this case, various RNA polymerases such as the SP6 or the 55 
T7 RNA polymerase can be used on appropriate vectors 
containing, for example, the SP6 or the T7 promoters (e.g. 
Axelrod et al, 'Transcription from Bacteriophage T7 and 
SP6 RNA Polymerase Promoters in the Presence of 3"-Deox- 
yribonucleoside 5 '-triphosphate Chain Terminators," Bio- 60 
chemistry 24, 5716-23 (1985)). In this case, the unknown 
DNA sequence fragments are inserted downstream from 
such promoters. Transcription can also be initiated by a 
nucleic acid primer (Pitulle et al., "Initiator Oligonucleotides 
for the Combination of Chemical and Enzymatic RNA 65 
Synthesis," Gene 112, 101-105 (1992)) which carries, as 
one embodiment of this invention, appropriate linking func- 


tionalities, L, which allow the immobilization of the nested 
RNA fragments, as outlined above, prior to mass spectro- 
metric analysis for purification and/or appropriate modifi- 
cation and/or conditioning. 

For this immobilization process of the DNA/RNA 
sequencing products for mass spectrometric analysis, vari- 
ous solid supports can be used, e.g., beads (silica gel, 
controlled pore glass, magnetic beads, Sephadex/Sepharose 
beads, cellulose beads, etc.), capillaries, glass fiber filters, 
glass surfaces, metal surfaces or plastic material. Examples 
of useful plastic materials include membranes in filter or 
microliter plate formats, the latter allowing the automation 
of the purification process by employing microtiter plates 
which, as one embodiment of the invention, carry a perme- 
able membrane in the bottom of the well functionalized with 
L. Membranes can be based on polyethylene, polypropy- 
lene, polyamide, polyvinylidenedifluoride and the like. 
Examples of suitable metal surfaces include steel, gold, 
silver, aluminum, and copper. After purification, cation 
exchange, and/or modification of the phosphodiester back- 
bone of the L — L bound nested Sanger fragments, they can 
be cleaved off the solid support chemically, enzymatically or 
physically. Also, the L— L bound fragments can be cleaved 
from the support when they are subjected to mass spectro- 
metric analysis by using appropriately chosen L — L link- 
ages and corresponding laser energies/intensities as 
described above and in FIGS. 19-23. 

The highly purified, four base-specifically terminated 
DNA or RNA fragment families are then analyzed with 
regard to their fragment lengths via determination of their 
respective molecular weights by MALDI or ES mass spec- 
trometry. 

For ES, the samples, dissolved in water or in a volatile 
buffer, are injected either continuously or discontinuously 
into an atmospheric pressure ionization interface (API) and 
then mass analyzed by a quadrupole. With the aid of a 
computer program, the molecular weight peaks are searched 
for the known molecular weight of the nucleic acid primer 
(UP) and determined which of the four chain-terminating 
nucleotides has been added to the UP. This represents the 
first nucleotide of the unknown sequence. Then, the second, 
the third, the n' H extension product can be identified in a 
similar manner and, by this, the nucleotide sequence is 
assigned. The generation of multiple ion peaks which can be 
obtained using ES mass spectrometry can increase the 
accuracy of the mass determination. 

In MALDI mass spectrometry, various mass analyzers 
can be used, e.g., magnetic sector/magnetic deflection 
instruments in single or triple quadrupole mode (MS/MS), 
Fourier transform and time-of-flight (TOP) configurations as 
is known in the art of mass spectrometry. FIGS. 2 A through 
6 are given as an example of the data obtainable when 
sequencing a hypothetical DNA fragment of 50 nucleotides 
in length (SEQ ID NO: 3) and having a molecular weight of 
15,344.02 daltons. The molecular weights calculated for the 
ddT (FIGS. 2A and 2B), ddA (FIGS. 3 A and 3B), ddG 
(FIGS. 4A and 4B) and ddC (FIGS. 5A and 5B) terminated 
products are given (corresponding to fragments of SEQ ID 
NO:3) and the idealized four MALDI-TOF mass spectra 
shown. All four spectra are superimposed, and from this, the 
DNA sequence can be generated. This is shown in the 
summarizing FIG. 6, demonstrating how the molecular 
weights are correlated with the DNA sequence. MALDI- 
TOF spectra have been generated for the ddT terminated 
products (FIG. 16) corresponding to those shown in FIG. 2 
and these spectra have been superimposed (FIG. 17). The 
correlation of calculated molecular weights of the ddT 
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fragments and their experimentally-verified weights are 
shown in Table I. Likewise, if all four chain-terminating 
reactions are combined and then analyzed by mass spec- 
trometry, the molecular weight difference between two adja- 
cent peaks can be used to determine the sequence. For the 
desorption/ionization process, numerous matrix/laser com- 
binations can be used. 

TABLE I 

Correlation of calculated and experimentally verified molecular 
weights of the 13 DNA fragments of FIGS. 2 and 16. 
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(n-mer) 

calculated mass 

experimental mass 

difference 

7-mer 

2104.45 

2119.9 

+15.4 

10-mer 

3011.04 

3026.1 

+15.1 

11-mer 

3315.24 

3330.1 

+14.9 

19 mer 

5771.82 

5788.0 

+16.2 

20-mer 

6076.02 

6093.8 

+17.8 

24- mer 

7311.82 

7374.9 

+63.1 

26- mer 

7945.22 

7960.9 

+15.7 

33-mer 

10112.63 

10125.3 

+12.7 

37-mer 

11348.43 

11361.4 

+13.0 

38-mer 

11652.62 

11670.2 

+17.6 

42-mer 

12872,42 

12888.3 

+15.9 

46-mer 

14108.22 

14125.0 

+16.8 

50-mer 

15344.02 

15362.6 

+18.6 


In order to increase throughput to a level necessary for 
high volume genomic and cDNA sequencing projects, a 
further embodiment of the present invention is to utilize 
multiplex mass spectrometry to simultaneously determine 30 
more than one sequence. This to can be achieved by several, 
albeit different, methodologies, the basic principle being the 
mass modification of the nucleic acid primer (UP), the 
chain-elongating and/or terminating nucleoside triphos- 
phates, or by using mass-differentiated tag probes hybridiz- 
able to specific tag sequences. The term "nucleic acid 35 
primer" as used herein encompasses primers for both DNA 
and RNA Sanger sequencing. 

By way of example, FIG. 7 presents a general formula of 
the nucleic acid primer (UP) and the tag probes (TP). The 
mass modifying moiety can be attached, for instance, to 40 
either the 5'-end of the oligonucleotide (M 1 ), to the nucleo- 
base (or bases) (M 2 , M 7 ), to the phosphate backbone (M 3 ), 
and to the 2-position of the nucleoside (nucleosides) (M 4 , 
M 6 ) or/and to the terminal 3 '-position (M 5 ). Primer length 
can vary between 1 and 50 nucleotides in length. For the 45 
priming of DNA Sanger sequencing, the primer is prefer- 
entially in the range of about 15 to 30 nucleotides in length. 
For artificially priming the transcription in a RNA poly- 
merase-mediated Sanger sequencing reaction, the length of 
the primer is preferentially in the range of about 2 to 6 50 
nucleotides. If a tag probe (TP) is to hybridize to the 
integrated tag sequence of a family chain-terminated frag- 
ments, its preferential length is about 20 nucleotides. 

The table in FIG. 7 depicts some examples of mass- 
modified primer/tag probe configurations for DNA, as well 55 
as RNA, Sanger sequencing. This list is, however, not meant 
to be limiting, since numerous other combinations of mass- 
modifying functions and positions within the oligonucle- 
otide molecule are possible and are deemed part of the 
invention. The mass-modifying functionality can be, for go 
example, a halogen, an azido, or of the type, XR, wherein X 
is a linking group and R is a mass-modifying functionality. 
The mass-modifying functionality can thus be used to intro- 
duce defined mass increments into the oligonucleotide mol- 
ecule. 65 

In another embodiment, the nucleotides used for chain- 
elongation and/or termination are mass-modified. Examples 


of such modified nucleotides are shown in FIG. 8. Here the 
mass-modifying moiety, M, can be attached either to the 
nucleobase, M (in case of the c 7 -deazanucleosides also to 
C-7, M 7 ), to the triphosphate group at the alpha phosphate, 
M 3 , or to the 2'position of the sugar ring of the nucleoside 
triphosphate, M and M 6 . Furthermore, the mass-modifying 
functionality can be added so as to affect chain termination, 
such as by attaching it to the 3'-position of the sugar ring in 
the nucleoside triphosphate, M 5 . The list in FIG. 8 represents 
examples of possible configurations for generating chain- 
terminating nucleoside triphosphates for RNA or DNA 
Sanger sequencing. For those skilled in the art, however, it 
is clear that many other combinations can serve the purpose 
of the invention equally well. In the same way, those skilled 
in the art will recognize that chain-elongating nucleoside 
triphosphates can also be mass-modified in a similar fashion 
with numerous variations and combinations in functionality 
and attachment positions. 

Without limiting the scope of the invention, FIG. 9 gives 
a more detailed description of particular examples of how 
the mass-modification, M, can be introduced for X in XR as 
well as using oligoVpolyethylene glycol derivatives for R. 
The mass-modifying increment in this case is 44, i.e. five 
different mass-modified species can be generated by just 
changing m from 0 to 4 thus adding mass units of 45 (m=0), 
89 (m=l), 133 (m=2), 177 (m=3) and 221 (m=4) to the 
nucleic acid primer (UP), the tag probe (TP) or the nucleo- 
side triphosphates respectively. The oligo/polyethylene gly- 
cols can also be monoalkylated by a lower alkyl such as 
methyl, ethyl, propyl, isopropyl, t-butyl and the like. A 
selection of linking functionalities, X, are also illustrated. 
Other chemistries can be used in the mass-modified com- 
pounds, as for example, those described recently in Oligo- 
nucleotides and Analogues, A Practical Approach, F. Eck- 
stein, editor, IRL Press, Oxford, 1991. 

In yet another embodiment, various mass-modifying 
functionalities, R, other than oligo/polyethylene glycols, can 
be selected and attached via appropriate linking chemistries, 
X. Without any limitation, some examples are given in FIG. 
10. A simple mass-modification can be achieved by substi- 
tuting H for halogens like F, CI, Br and/or I, or pseudohalo- 
gens such as SCN, NCS, or by using different alkyl, aryl or 
aralkyl moieties such as methyl, ethyl, propyl, isopropyl, 
t-butyl, hexyl, phenyl, substituted phenyl, benzyl, or func- 
tional groups such as CH 2 F, CHF 2 , CF 3 , Si(CH 3 ) 3 , 
Si(CH 3 ) 2 (C 2 H 5 ), Si(CH 3 )(C 2 H 5 ) 2 , Si(C 2 H s ) 3 . Yet another 
mass-modification can be obtained by attaching homo- or 
heteropeptides through X to the UP, TP or nucleoside 
triphosphates. One example useful in generating mass-modi- 
fied species with a mass increment of 57 is the attachment 
of oligoglycines, e.g., mass-modifications of 74 (r=l, m=0), 
131 (r=l, m=2), 188 (r-1, m=3), 245 (r=l, m=4) are 
achieved. Simple oligoamides also can be used, e.g., mass- 
modifications of 74 (r=l, m=0), 88 (r=2, m=0), 102 (r=3, 
m=0), 116 (r=4, m==0), etc. are obtainable. For those skilled 
in the art, it will be obvious that there are numerous 
possibilities in addition to those given in FIG. 10 and the 
above mentioned reference (Oligonucleotides and Ana- 
logues, F. Eckstein, 1991), for introducing, in a predeter- 
mined manner, many different mass-modifying functional- 
ities to UP, TP and nucleoside triphosphates which are 
acceptable for DNA and RNA Sanger sequencing. 

As used herein, the superscript 0-i designates i+1 mass 
differentiated nucleotides, primers or tags. In some 
instances, the superscript 0 (e.g., NTP°, UP 0 ) can designate 
an unmodified species of a particular reactant, and the 
superscript i (e.g., NTP\ NTP 1 , NTP 2 , etc.) can designate 
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the i-th mass-modified species of that reactant. If, for 
example, more than one species of nucleic acids (e.g., DNA 
clones) are to be concurrently sequenced by multiplex DNA 
sequencing, then i+1 different mass-modified nucleic acid 
primers (UP 0 , UP 1 , . . . UP') can be used to distinguish each 5 
set of base-specifically terminated fragments, wherein each 
species of mass-modified UP 1 can be distinguished by mass 
spectrometry from the rest. 

As illustrative embodiments of this invention, three dif- 
ferent basic processes for multiplex mass spectrometric 10 
DNA sequencing employing the described mass-modified 
reagents are described below: 

A) Multiplexing by the use of mass-modified nucleic acid 
primers (UP) for Sanger DNA or RNA sequencing (see 
for example FIG. 11); 15 

B) Multiplexing by the use of mass-modified nucleoside 
triphosphates as chain elongators and/or chain termi- 
nators for Sanger DNA or RNA sequencing (see for 
example FIG. 12); and 

C) Multiplexing by the use of tag probes which specifi- 20 
cally hybridize to tag sequences which are integrated 
into part of the four Sanger DNA/RNA base-specifi- 
cally terminated fragment families. Mass modification 
here can be achieved as described for FIGS. 7, 9 and 10, 

or alternately, by designing different oligonucleotide 25 
sequences having the same or different length with 
unmodified nucleotides which, in a predetermined way, 
generate appropriately differentiated molecular weights 
(see for example FIG. 13). 
The process of multiplexing by mass-modified nucleic 30 
acid primers (UP) is illustrated by way of example in FIG. 
11 for mass analyzing four different DNA clones simulta- 
neously. The first reaction mixture is obtained by standard 
Sanger DNA sequencing having unknown DNA fragment 1 
(clone 1) integrated in an appropriate vector (e.g., 35 
M13mpl8), employing an unmodified nucleic acid primer 
UP 0 , and a standard mixture of the four unmodified deoxy- 
nucleoside triphosphates, dNTP 0 , and with Vioth of one of 
the four dideoxynucleoside triphosphates, ddNTP°. A sec- 
ond reaction mixture for DNA fragment 2 (clone 2) is 40 
obtained by employing a mass-modified nucleic acid primer 
UP 1 and, as before, the four unmodified nucleoside triph- 
osphates, dNTP 0 , containing in each separate Sanger reac- 
tion 1 / 10th of the chain-terminating unmodified dideoxy- 
nucleoside triphosphates ddNTP 0 . In the other two 45 
experiments, the four Sanger reactions have the following 
compositions: DNA fragment 3 (clone 3), UP 2 , dNTP 0 , 
ddNTP 0 and DNA fragment 4 (clone 4), UP 3 , dNTP 0 , 
ddNTP°. For mass spectrometric DNA sequencing, all base- 
specifically terminated reactions of the four clones are 50 
pooled and mass analyzed. The various mass peaks belong- 
ing to the four dideoxy-terminated (e.g., ddT-terminated) 
fragment families are assigned to specifically elongated and 
ddT-terminated fragments by searching (such as by a com- 
puter program) for the known molecular ion peaks of UP°, 55 
UP 1 , UP 2 and UP 3 extended by either one of the four 
dideoxynucleoside triphosphates, UF^ddN 0 , UP^ddN 0 , 
UP 2 -ddN° and UP 3 -ddN°. In this way, the first nucleotides 
of the four unknown DNA sequences of clone 1 to 4 are 
determined. The process is repeated, having memorized the 60 
molecular masses of the four o specific first extension 
products, until the four sequences are assigned. Unambigu- 
ous mass/sequence assignments are possible even in the 
worst case scenario in which the four mass-modified nucleic 
acid primers are extended by the same dideoxynucleoside 65 
triphosphate, the extension products then being, for 
example, UP°-ddT, UP 1 -ddT, UP 2 -ddT and UP 3 -ddT, which 


differ by the known mass increment differentiating the four 
nucleic acid primers. In another embodiment of this inven- 
tion, an analogous technique is employed using different 
vectors containing, for example, the SP6 and/or T7 promoter 
sequences, and performing transcription with the nucleic 
acid primers UP 0 , UP 1 , UP 2 and UP 3 and either an RNA 
polymerase (e.g., SP6 or T7 RNA polymerase) with chain- 
elongating and terminating unmodified nucleoside triphos- 
phates NTP° and 3'-dNTP°. Here, the DNA sequence is 
being determined by Sanger RNA sequencing. 

FIG. 12 illustrates the process of multiplexing by mass- 
modified chain-elongating or/and terminating nucleoside 
triphosphates in which three different DNA fragments (3 
clones) are mass analyzed simultaneously. The first DNA 
Sanger sequencing reaction (DNA fragment 1, clone 1) is the 
standard mixture employing unmodified nucleic acid primer 
UP 0 , dNTP 0 and in each of the four reactions one of the four 
ddNTP 0 . The second (DNA fragment 2, clone 2) and the 
third (DNA fragment 3, clone 3) have the following con- 
tents: UP°, dNTP°, ddNTP 1 and UP", dNTP°, ddNTP 2 , 
respectively. In a variation of this process, an amplification 
of the mass increment in mass-modifying the extended DNA 
fragments can be achieved by either using an equally 
mass-modified deoxynucleoside triphosphate (i.e., dNTP 1 , 
dNTP 2 ) for chain elongation alone or in conjunction with the 
homologous equally mass- modified dideoxynucleoside 
triphosphate. For the three clones depicted above, the con- 
tents of the reaction mixtures can be as follows: either 
UP 0 /dNTP 0 /ddNTP°, UP 0 /dNTPVddNTP° and UP°/dNTP 2 / 
ddNTP 0 or UI*7dNTP°/ddNTP 0 , UpO/dNTPVddNTP 1 and 
UP°/dNTP 2 /ddNTP 2 . As described above, DNA sequencing 
can be performed by Sanger RNA sequencing employing 
unmodified nucleic acid primers, UP 0 , and an appropriate 
mixture of chain-elongating and terminating nucleoside 
triphosphates. The mass-modification can be again either in 
the chain-terminating nucleoside triphosphate alone or in 
conjunction with mass-modified chain-elongating nucleo- 
side triphosphates. Multiplexing is achieved by pooling the 
three base-specifically terminated sequencing reactions 
(e.g., the ddTTP terminated products) and simultaneously 
analyzing the pooled products by mass spectrometry. Again, 
the first extension products of the known nucleic acid primer 
sequence are assigned, e.g., via a computer program. Mass/ 
sequence assignments are possible even in the worst case in 
which the nucleic acid primer is extended/terminated by the 
same nucleotide, e.g., ddT, in all three clones. The following 
configurations thus obtained can be well differentiated by 
their different mass-modifications: UP°-ddT°, UP^ddT 1 , 
UP°-ddT 2 

In yet another embodiment of this invention, DNA 
sequencing by multiplex mass spectrometry can be achieved 
by cloning the DNA fragments to be sequenced in "plex- 
vectors" containing vector specific "tag sequences" as 
described (Koster et al., "Oligonucleotide Synthesis and 
Multiplex DNA Sequencing Using Chemiluminescent 
Detection," Nucleic Acids Res. Symposium Ser. No. 24, 
318-321 (1991)); then pooling clones from different plex- 
vectors for DNA preparation and the four separate Sanger 
sequencing reactions using standard dNTP a /ddNTP° and 
nucleic acid primer UP°; purifying the four multiplex frag- 
ment families via linking to a solid support through the 
linking group, L, at the 5'-end of UP; washing out all 
by-products, and cleaving the purified multiplex DNA frag- 
ments off the support or using the L — L 1 bound nested 
Sanger fragments as such for mass spectrometric analysis as 
described above; performing demultiplexing by one-by-one 
hybridization of specific "tag probes"; and subsequently o 
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analyzing by mass spectrometry (see, for example, FIG. 13). 
As a reference point, the four base-specifically terminated 
multiplex DNA fragment families are run by the mass 
spectrometer and all ddT 0 -, ddA°-, ddC°- and ddG°-termi- 
nated molecular ion peaks are respectively detected and 5 
memorized. Assignment of, for example, ddT°-terminated 
DNA fragments to a specific fragment family is accom- 
plished by another mass spectrometric analysis after hybrid- 
ization of the specific tag probe (TP) to the corresponding 
tag sequence contained in the sequence of this specific io 
fragment family. Only those molecular ion peaks which are 
capable of hybridizing to the specific tag probe are shifted to 
a higher molecular mass by the same known mass increment 
(e.g. of the tag probe). These shifted ion peaks, by virtue of 
all hybridizing to a specific tag probe, belong to the same is 
fragment family. For a given fragment family, this is 
repeated for the remaining chain terminated fragment fami- 
lies with the same tag probe to assign the complete DNA 
sequence. This process is repeated i-1 times corresponding 
to i clones multiplexed (the i-th clone is identified by 20 
default). 

The differentiation of the tag probes for the different 
multiplexed clones can be obtained just by the DNA 
sequence and its ability to Watson-Crick base pair to the tag 
sequence. It is well known in the art how to calculate 25 
stringency conditions to provide for specific hybridization of 
a given tag probe with a given tag sequence (see, for 
example, Molecular Cloning: A laboratory manual 2ed, ed. 
by Sambrook, Fritsch and Maniatis (Cold Spring Harbor 
Laboratory Press: N.Y., 1989, Chapter 11). Furthermore, 30 
differentiation can be obtained by designing the tag 
sequence for each plex-vector to have a sufficient mass 
difference so as to be unique just by changing the length or 
base composition or by mass-modifications according to 
FIGS. 7, 9 and 10. In order to keep the duplex between the 35 
tag sequence and the tag probe intact during mass spectro- 
metric analysis, it is another embodiment of the invention to 
provide for a covalent attachment mediated by, for example, 
photoreactive groups such as psoralen and ellipticine and by 
other methods known to those skilled in the art (see, for 40 
example, Helene et al., Nature 344, 358 (1990) and Thuong 
et al. "Oligonucleotides Attached to Intercalators, Photore- 
active and Cleavage Agents" in F. Eckstein, Oligonucle- 
otides and Analogues: A Practical Approach, IRL Press, 
Oxford 1991, 283-306). 45 

The DNA sequence is unraveled again by searching for 
the lowest molecular weight molecular ion peak correspond- 
ing to the known UP°-tag sequence/tag probe molecular 
weight plus the first extension product, e.g., ddT°, then the 
second, the third, etc. 50 

In a combination of the latter approach with the previ- 
ously described multiplexing processes, a further increase in 
multiplexing can be achieved by using, in addition to the tag 
probe/tag sequence interaction, mass-modified nucleic acid 
primers (FIG. 7) and/or mass-modified deoxynucleoside, 55 
dNTP 0 " 1 , and/or dideoxy nucleoside triphosphates, ddNTP 0 '. 
Those skilled in the art will realize that the tag sequence/tag 
probe multiplexing approach is not limited to Sanger DNA 
sequencing generating nested DNA fragments with DNA 
polymerases. The DNA sequence can also be determined by 60 
transcribing the unknown DNA sequence from appropriate 
promoter-containing vectors (see above) with various RNA 
polymerases and mixtures of NTP°"73'-dNTP° thus gen- 
erating nested RNA fragments. 

In yet another embodiment of this invention, the mass- 65 
modifying functionality can be introduced by a two or 
multiple step process. In this case, the nucleic acid primer, 
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the chain-elongating or tenriinating nucleoside triphosphates 
and/or the tag probes are, in a first step, modified by a 
precursor functionality such as azido, — N 3 , or modified 
with a functional group in which the R in XR is H (FIG. 7, 
9) thus providing temporary functions, e.g., but not limited 
to —OH, — NH 2 , — NHR, — SH, — NCS, 
— OCO(CH2) r COOH (r=l-20), — NHCO(CH 2 ) r COOH 
(r-1-20), — OS0 2 OH, — OCO(CH 2 ),I (r=l-20), — OP(0- 
Alkyl)N(Alkyl) 2 . These less bulky functionalities result in 
better substrate properties for the enzymatic DNA or RNA 
synthesis reactions of the DNA sequencing process. The 
appropriate mass-modifying functionality is then introduced 
after the generation of the nested base-specifically termi- 
nated DNA or RNA fragments prior to mass spectrometry. 
Several examples of compounds which can serve as mass- 
modifying functionalities are depicted in FIGS. 9 and 10 
without limiting the scope of this invention. 

Another aspect of this invention concerns kits for 
sequencing nucleic acids by mass spectrometry which 
include combinations of the above-described sequencing 
reactants. For instance, in one embodiment, the kit com- 
prises reactants for multiplex mass spectrometric sequenc- 
ing of several different species of nucleic acid. The kit can 
include a solid support having a linking functionality (L 1 ) 
for immobilization of the base-specifically terminated prod- 
ucts; at least one nucleic acid primer having a linking group 
(L) for reversibly and temporarily linking the primer and 
solid support through, for example, a photocleavable bond; 
a set of chain-elongating nucleotides (e.g., dATP, dCTP, 
dGTP and dTTP, or ATP, CTP, GTP and UTP); a set of 
chain-terminating nucleotides (such as 2',3'-dideoxynucle- 
otides for DNA synthesis or 3-deoxynucleotides for RNA 
synthesis); and an appropriate polymerase for synthesizing 
complementary nucleotides. Primers and/or terminating 
nucleotides can be mass-modified so that the base-specifi- 
cally terminated fragments generated from one of the spe- 
cies of nucleic acids to be sequenced can be distinguished by 
mass spectrometry from all of the others. Alternative to the 
use of mass-modified synthesis reactants, a set of tag probes 
(as described above) can be included in the kit. The kit can 
also include appropriate buffers as well as instructions for 
performing multiplex mass spectrometry to concurrently 
sequence multiple species of nucleic acids. 

In another embodiment, a nucleic acid sequencing kit can 
comprise a solid support as described above, a primer for 
initiating synthesis of complementary nucleic acid frag- 
ments, a set of chain-elongating nucleotides and an appro- 
priate polymerase. The mass-modified chain-terminating 
nucleotides are selected so that the addition of one of the 
chain terminators to a growing complementary nucleic acid 
can be distinguished by mass spectrometry. 

EXAMPLE 1 

Immobilization of Primer-Extension Products of Sanger 
DNA Sequencing Reaction For Mass Spectrometric Analy- 
sis Via Disulfide Bonds 

As a solid support, Sequelon membranes (Millipore 
Corp., Bedford, Mass.) with phenyl isothiocyanate groups 
are used as a starting material. The membrane disks, with a 
diameter of 8 mm, are wetted with a solution of N-methyl- 
morpholine/water/2-propanol (NMM solution) (2/49/49 v/v/ 
v), the excess liquid removed with filter paper and placed on 
a piece of plastic film or alurriinum foil located on a heating 
block set to 55° C. A solution of 1 mM 2-mercaptoethy- 
lamine (cysteamine) or 2,2'-dithio-bis(ethylamine) (cysta- 
mine) or S-(2-thiopyridyl)-2-thio-emylamine (10 ul, 10 
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nmol) in NMM is added per disk and heated at 55° C. After 
15 rain, 10 ul of NMM solution are added per disk and 
heated for another 5 min. Excess of isothiocyanate groups 
may be removed by treatment with 10 ul of a 10 mM 
solution of glycine in NMM solution. For cystamine, the 5 
disks are treated with 10 ul of a solution of 1M aqueous 
dithiothreitol (DTT)/2-propanol (1:1 v/v) for 15 min at room 
temperature. Then, the disks are thoroughly washed in a 
filtration manifold with 5 aliquots of 1 ml each of the NMM 
solution, then with 5 aliquots of 1 ml acetonitrile/water (1/1 
v/v) and subsequently dried. If not used immediately the 
disks are stored with free thiol gxoups in a solution of 1M 
aqueous dithiothreitol/2-propanol (1:1 v/v) and, before use, 
DTT is removed by three washings with 1 ml each of the 
NMM solution. The primer oligonucleotides with 5'-SH 
functionality can be prepared by various methods (e.g. , B. C. 15 
F. Chu et al., Nucleic Acids Res. 14, 5591-5603 (1986), 
Sproat et al., Nucleic Acids Res. 15, 4837^8 (1987) and 
Oligonucleotides and Analogues: A Practical Approach (F. 
Eckstein, editor), IRL Press Oxford, 1991). Sequencing 
reactions according to the Sanger protocol are performed in 20 
a standard way (e.g., H. Swerdlow et al., Nucleic Acids Res. 
18, 1415-19 (1990)). In the presence of about 7-10 mM 
DTT the free 5'-thiol primer can be used; in other cases, the 
SH functionality can be protected, e.g., by a trityl group 
during the Sanger sequencing reactions and removed prior to 25 
anchoring to the support in the following way. The four 
sequencing reactions (150 ul each in an Eppendorf tube) are 
terminated by a 10 min incubation at 70° C. to denature the 
DNA polymerase (such as Klenow fragment, Sequenase) 
and the reaction mixtures are ethanol precipitated. The 30 
supernatants are removed and the pellets vortexed with 25 ul 
of an 1M aqueous silver nitrate solution, and after one hour 
at room temperature, 50 ul of an 1M aqueous solution of 
DTT is added and mixed by vortexing. After 15 min, the 
mixtures are centrifuged and the pellets are washed twice 35 
with 100 ul ethylacetate by vortexing and centrifugation to 
remove excess DTT. The primer extension products with 
free 5'-thiol group are now coupled to the thiolated mem- 
brane supports under mild oxidizing conditions. In general, 
it is sufficient to add the 5'-thiolated primer extension 40 
products dissolved in 10 ul 10 mM de-aerated triethylam- 
monium acetate buffer (TEAA) pH 7.2 to the thiolated 
membrane supports. Coupling is achieved by drying the 
samples onto the membrane disks with a cold fan. This 
process can be repeated by wetting the membrane with 10 ul 45 
of 10 mM TEAA buffer pH 7.2 and drying as before. When 
using the 2-thiopyridyl derivatized compounds, anchoring 
can be monitored by the release of pyridine-2-thione spec- 
trophotometrically at 343 nm. 

In another variation of this approach, the oligonucleotide 50 
primer is functionalized with an amino group at the 5'-end 
which is introduced by standard procedures during auto- 
mated DNA synthesis. After primer extension, during the 
Sanger sequencing process, the primary amino group is 
reacted with 3-(2-pyridyldithio) propionic acid N-hydrox- 55 
ysuccinimide ester (SPDP) and subsequently coupled to the 
thiolated supports and monitored by the release of pyridyl- 
2-thione as described above. After denaturation of DNA 
polymerase and ethanol precipitation of the sequencing 
products, the supernatants are removed and the pellets 60 
dissolved in 10 ul 10 mM TEAA buffer pH 7.2 and 10 ul of 
a 2 mM solution of SPDP in 10 mM TEAA are added. The 
reaction mixture is vortexed and incubated for 30 min at 25° 
C. Excess SPDP is then removed by three extractions 
(vortexing, centrifugation) with 50 ul each of ethanol and the 65 
resulting pellets are dissolved in 10 ul 10 mM TEAA buffer 
pH 7.2 and coupled to the thiolated supports (see above). 
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The primer-extension products are purified by washing 
the membrane disks three times each with 100 ul NMM 
solution and three times with 100 ul each of 10 mM TEAA 
buffer pH 7.2. The purified primer-extension products are 
removed by three successive treatments with 10 ul of 10 mM 
2-mercaptoethanol in 10 mM TEAA buffer pH 7.2, lyo- 
philized and analyzed by either ES or MALD1 mass spec- 
trometry. 

This procedure can also be used for the mass-modified 
nucleic acid primers UP 0 "* in an analogous and appropriate 
way, taking into account the chemical properties of the 
mass-modifying functionalities. 

EXAMPLE 2 

Immobilization of Primer-Extension Products of Sanger 
DNA Sequencing Reaction for Mass Spectrometric Analysis 
Via the Levulinyl Group 

5- Aminolevulinic acid is protected at the primary amino 
group with the Fmoc group using 9-fluorenylmethyl N-suc- 
cinimidyl carbonate and is then transformed into the N-hy- 
droxysucdriimide ester (NHS ester) using N-hydroxysuc- 
cinimide and dicyclohexyl carbodiimide under standard 
conditions. For the Sanger sequencing reactions, nucleic 
acid primers, UP 0 " 1 , are used which are functionalized with 
a primary ammo group at the 5 '-end introduced by standard 
procedures during automated DNA synthesis with araino- 
linker phosphoamidites as the final synthetic step. Sanger 
sequencing is performed under standard conditions (see 
above). The four reaction mixtures (150 ul each in an 
Eppendorf tube) are heated to 70° C. for 10 min to inactivate 
the DNA polymerase, ethanol precipitated, centrifuged and 
resuspended in 10 ul of 10 mM TEAA buffer pH 7.2. 10 ul 
of a 2 mM solution of the Fmoc-5-aminolevulinyl-NHS 
ester in 10 mM TEAA buffer is added, vortexed and incu- 
bated at 25° C. for 30 min. The excess of the reagent is 
removed by ethanol precipitation and centrifugation. The 
Fmoc group is cleaved off by resuspending the pellets in 10 
ul of a solution of 20% piperidine in N,N-dimethyIforma- 
mide/water (1:1 v/v). After 15 min at 25° C, piperidine is 
thoroughly removed by three precipitations/centrifugations 
with 100 ul each of ethanol, the pellets are resuspended in 
10 ul of a solution of N-methylmorpholme, 2-propanol and 
water (2/10/88 v/v/v) and are coupled to the solid support 
carrying an isothiocyanate group. In the case of the DITC- 
Sequelon membrane (Millipore Corp., Bedford, Mass.), the 
membranes are prepared as described in EXAMPLE 1 and 
coupling is achieved on a heating block at 55° C. as 
described above. RNA extension products are immobilized 
in an analogous way. The procedure can be applied to other 
solid supports with isothiocyanate groups in a similar man- 
ner. 

The immobilized primer-extension products are exten- 
sively washed three times with 100 ul each of NMM solution 
and three times with 100 ul 1 0 mM TEAA buffer pH 7.2. The 
purified primer-extension products are released by three 
successive treatments with 10 ul of 100 mM hydrazinium 
acetate buffer pH 6.5, lyophilized and analyzed by either ES 
or MALDI mass spectrometry. 

EXAMPLE 3 

Immobilization of Primer-Extension Products of Sanger 
DNA Sequencing Reaction for Mass Spectrometric Analysis 
Via a Trypsin Sensitive Linkage 

Sequelon DITC membrane disks of 8 mm diameter (Mil- 
lipore Corp., Bedford, Mass.) are wetted with 10 ul of NMM 
solution (N-methylmorpholine/propanaol-2/water; 2/49/49 
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v/v/v) and a linker arm introduced by reaction with 10 ul of 
a 10 mM solution of 1,6-diaminohexane in NMM. The 
excess diamine is removed by three washing steps with 100 
ul of NMM solution. Using standard peptide synthesis 
protocols, two L-lysine residues are attached by two sue- 5 
cessive condensations with N-Fmoc-N-tBoc-L-lysine pen- 
tafluorophenylester, the terminal Fmoc group is removed 
with piperidine in NMM and the free oc-amino group 
coupled to 1,4-phenylene diisothiocyanate (DITC). Excess 
DITC is removed by three washing steps with 100 ul 10 
2-propanol and the N-tBoc groups removed with trifluoro- 
acetic acid according to standard peptide synthesis proce- 
dures. The nucleic acid primer-extension products are pre- 
pared from oligonucleotides which carry a primary amino 
group at the 5'-terminus. The four Sanger DNA sequencing 15 
reaction mixtures (150 ul each in Eppendorf tubes) are 
heated for 10 min at 70° C. to inactivate the DNA poly- 
merase, ethanol precipitated, and the pellets resuspended in 
10 ul of a solution of N-methylmorpholine, 2-propanol and 
water (2/10/88 v/v/v). This solution is transferred to the 20 
Lys-Lys-DITC membrane disks and coupled on a heating 
block set at 55° C. After drying, 10 ul of NMM solution is 
added and the drying process repeated. 

The immobilized primer-extension products are exten- 
sively washed three times with 1 00 ul each of NMM solution 25 
and three times with 100 ul each of 10 mM TEAA buffer pH 
7.2. For mass spectrometric analysis, the bond between the 
primer-extension products and the solid support is cleaved 
by treatment with trypsin under standard conditions and the 
released products analyzed by either ES or MALDI mass 30 
spectrometry with trypsin serving as an internal mass stan- 
dard. 

EXAMPLE 4 

Immobilization of Primer-Extension Products of Sanger 35 
DNA Sequencing Reaction for Mass Spectrometric Analysis 
Via Pyrophosphate Linkage 

The DITC Sequelon membrane (disks of 8 mm diameter) 
are prepared as described in EXAMPLE 3 and 10 ul of a 10 
mM solution of 3-aminopyridme adenine dinucleotide 40 
(APAD) (Sigma) in NMM solution added. The excess APAD 
is removed by a 10 ul wash of NMM solution and the disks 
are treated with 10 ul of 10 mM sodium periodate in NMM 
solution (15 min, 25° C). Excess periodate is removed and 
the primer-extension products of the four Sanger DNA 
sequencing reactions (150 ul each in Eppendorf tubes) 
employing nucleic acid primers with a primary amino group 
at the 5'-end are ethanol precipitated, dissolved in 10 ul of 
a solution of N-methyImorpholine/2-propanol/water (2/10/ 
88 v/v/v) and coupled to the 2'3-dialdehydo groups of the 50 
immobilized NAD analog. 

The primer-extension products are extensively washed 
with the NMM solution (3 times with 100 ul each) and 10 
mM TEAA buffer pH 7.2 (3 times with 100 ul each) and the 
purified primer-extension products are released by treatment 
with either NADase or pyrophosphatase in 10 mM TEAA 
buffer at pH 7.2 at 37° C. for 15 min, lyophilized and 
analyzed by either ES or MALDI mass spectrometry, the 
enzymes serving as internal mass standards. fiQ 

EXAMPLE 5 

Synthesis of Nucleic Acid Primers Mass-Modified by Gly- 
cine Residues at the 5-Position of the Sugar Moiety of the 
Terminal Nucleoside 65 

Oligonucleotides are synthesized by standard automated 
DNA synthesis using j5-cyanoethylphosphoamidites (H. K 


oster et al., Nucleic Acids Res. 12, 4539 (1984)) and a 
5 '-amino group is introduced at the end of solid phase DNA 
synthesis (e.g. Agrawal et al., Nucleic Acids Res. 14, 
6227-45 (1986) or Sproat et al., Nucleic Acids Res. 15, 
6181-96 (1987)). The total amount of an oligonucleotide 
synthesis, starting with 0.25 umol CPG-bound nucleoside, is 
deprotected with concentrated aqueous ammonia, purified 
via OligoPAK™ Cartridges (Millipore Corp., Bedford, 
Mass.) and lyophilized. This material with a 5'-terminal 
amino group is dissolved in 100 ul absolute N,N-dimethyl- 
formamide (DMF) and condensed with 10 umole N-Fmoc- 
glycine pentafluorophenyl ester for 60 min at 25° C. After 
ethanol precipitation and centrifugation, the Fmoc group is 
cleaved off by a 10 min treatment with 100 ul of a solution 
of 20% piperidine in N,N-dimethylformamide. Excess pip- 
eridine, DMF and the cleavage product from the Fmoc group 
are removed by ethanol precipitation and the precipitate 
lyophilized from 10 mM TEAA buffer pH 7.2. This material 
is now either used as primer for the Sanger DNA sequencing 
reactions or one or more glycine residues (or other suitable 
protected amino acid active esters) are added to create a 
series of mass-modified primer oligonucleotides suitable for 
Sanger DNA or RNA sequencing. Immobilization of these 
mass-modified nucleic acid primers UP°^ after primer- 
extension during the sequencing process can be achieved as 
described, e.g., in EXAMPLES 1 to 4. 

EXAMPLE 6 

Synthesis of Nucleic Acid Primers Mass-Modified at C-5 of 
the Heterocyclic Base of a Pyrimidine Nucleoside with 
Glycine Residues 

Starting material was 5-(3-aminopropynyl-l)-3'5'-di-p- 
tolyldeoxyuridine prepared and 3'5'-de-0-acylated accord- 
ing to literature procedures (Haralambidis et al., Nucleic 
Acids Res. 15, 4857-76 (1987)). 0.281 g (1.0 mmole) 
5-(3-aminopropynyl-l)- 2'-deoxyuridine were reacted with 
0.927 g (2.0 mmole) N-Fmoc-glycine pentafluoropheny- 
lester in 5 ml absolute N,N-dimethylformamide in the pres- 
ence of 0.129 g (1 mmole; 174 ul) N.N-diisopropylethy- 
lamine for 60 min at room temperature. Solvents were 
removed by rotary evaporation and the product was purified 
by silica gel chromatography (Kieselgei 60, Merck; column: 
2.5x50 cm, elution with chloroform/methanol mixtures). 
Yield was 0.44 g (0.78 mmole, 78%). In order to add another 
glycine residue, the Fmoc group is removed with a 20 min 
treatment with 20% solution of piperidine in DMF, evapo- 
rated in vacuo and the remaining solid material extracted 
three times with 20 ml ethylacetate. After having removed 
the remaining ethylacetate, N-Fmoc-glycine pentafiuo- 
rophenylester is coupled as described above. 5-(3 -(N-Fmoc- 
glycyl)-amidopropynyl-l)-2'-deoxy uridine is transformed 
into the 5'-0-dimethoxytritylated nucleoside-3'-0-(S-cyano- 
ethyl-N,N-diisopropylphosphoamidite and incorporated into 
automated oligonucleotide synthesis by standard procedures 
(H. Koster et at., Nucleic Acids Res. 12, 2261 (1984)). This 
glycine modified thymidine analogue building block for 
chemical DNA synthesis can be used to substitute one or 
more of the thymidine/uridine nucleotides in the nucleic acid 
primer sequence. The Fmoc group is removed at the end of 
the solid phase synthesis with a 20 min treatment with a 20% 
solution of piperidine in DMF at room temperature. DMF is 
removed by a washing step with acetonitrile and the oligo- 
nucleotide deprotected and purified in the standard way. 
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Synthesis of a Nucleic Acid Primer Mass-Modified at C-5 of 
the Heterocyclic Base of a Pyrimidine Nucleoside with 
p-Alanine Residues 

Starting material was the same as in EXAMPLE 6. 0.281 
g (1.0 mmole) 5-(3-Aminopropynyl-l)-2-deoxyuridine was 
reacted with N-Fmoc-p-alanine pentafluorophenylester 
(0.955 g, 2.0 mmole) in 5 ml N,N-dimethylformamide 
(DMF) in the presence of 0.129 g (174 ul; 1.0 mmole) 1Q 
N,N-disopropylethylamine for 60 min at room temperature. 
Solvents were removed and the product purified by silica gel 
chromatography as described in EXAMPLE 6. Yield was 
0.425 g (0.74 mmole, 74%). Another p-alanine moiety can 
be added in exactly the same way after removal of the Fmoc 15 
group. The preparation of the 5'-0-dimethoxytritylated 
nucleoside-B'-O-p-cyanoethyl-N^-diisopropylphosphoam- 
idite from 5-(3-(N-Fmoc-p-alanyl)-amidopropynyl-l)-2'- 
deoxyuridine and incorporation into automated oligonucle- 
otide synthesis is performed under standard conditions. This 2Q 
building block can substitute for any of the thymidine/ 
uridine residues in the nucleic acid primer sequence. In the 
case of only one incorporated mass-modified nucleotide, the 
nucleic acid primer molecules prepared according to 
EXAMPLES 6 and 7 would have a mass difference of 14 25 
daltons. 


EXAMPLE 8 

Synthesis of a Nucleic Acid Primer Mass-Modified at C-5 of 30 
the Heterocyclic Base of a Pyrimidine Nucleoside with 
Ethylene Glycol Monomethyl Ether 

As a nucleosidic component, 5-(3-aminopropynyl-l)-2'- 
deoxyuridine was used in this example (see EXAMPLES 6 
and 7). The mass-modifying functionality was obtained as 35 
follows: 7.61 g (100.0 mmole) freshly distilled ethylene 
glycol monomethyl ether dissolved in 50 ml absolute pyri- 
dine was reacted with 10.01 g (100.0 mmole) recrystallized 
succinic anhydride in the presence of 1.22 g (10.0 mmole) 
4-N,N-dimethylaminopyridine overnight at room tempera- 40 
ture. The reaction was terminated by the addition of water 
(5.0 ml), the reaction mixture evaporated in vacuo, co- 
evaporated twice with dry toluene (20 ml each) and the 
residue redissolvedin 100 ml dichloromethane. The solution 
was extracted successively, twice with 10% aqueous citric 45 
acid (2x20 ml) and once with water (20 ml) and the organic 
phase dried over anhydrous sodium sulfate. The organic 
phase was evaporated in vacuo, the residue redissolved in 50 
ml dichloromethane and precipitated into 500 ml pentane 
and the precipitate dried in vacuo. Yield was 13.12 g (74.0 50 
mmole; 74%). 8.86 g (50.0 mmole) of succinylated ethylene 
glycol monomethyl ether was dissolved in 100 ml dioxane 
containing 5% dry pyridine (5 ml) and 6.96 g (50.0 mmole) 
4-nitrophenol and 10.32 g (50.0 mmole) dicyclohexylcar- 
bodiimide was added and the reaction run at room tempera- 55 
ture for 4 hours. Dicyclohexylurea was removed by filtra- 
tion, the filtrate evaporated in vacuo and the residue 
redissolved in 50 ml anhydrous DMF. 12.5 ml (about 12.5 
mmole 4 -nitrophenylester) of this solution was used to 
dissolve 2.81 g (10.0 mmole) 5-(3 -arninopropynyl-l)-2 r - 60 
deoxyuridine. The reaction was performed in the presence of 
1.01 g (10.0 mmole; 1.4 ml) triethylamine at room tempera- 
ture overnight. The reaction mixture was evaporated in 
vacuo, co-evaporated with toluene, redissolved in dichlo- 
romethane and chromatographed on silicagel (Si60, Merck; 65 
column 4x50 cm) with dichloromethane/methanol mixtures. 
The fractions containing the desired compound were col- 


lected, evaporated, redissolved in 25 ml dichloromethane 
and precipitated into 250 ml pentane. The dried precipitate 
of 5-(3-N-(0-succinyl ethylene glycol monomethyl ether)- 
amidopropynyl- l)-2'-deoxyuridine (yield: 65%) is 5-0- 
dimethoxytritylated and transformed into the nucleoside-3'- 
0-B-cyanoemyl-N,N-diisopropylphosphoamidite and 
incorporated as a building block in the automated oligo- 
nucleotide synthesis according to standard procedures. The 
mass-modified nucleotide can substitute for one or more of 
the thyrnidine/uridine residues in the nucleic acid primer 
sequence. Deprotection and purification of the primer oli- 
gonucleotide also follows standard procedures. 

EXAMPLE 9 

Synthesis of a Nucleic Acid Primer Mass-Modified at C-5 of 
the Heterocyclic Base of a Pyrimidine Nucleoside with 
Diethylene Glycol Monomethyl Ether 

Nucleosidic starting material was as in previous 
examples, 5-(3 -aminopropynyl-l)-2'-deoxyuridine. The 
mass-modifying functionality was obtained similar to 
EXAMPLE 8. 12.02 g (100.0 mmole) freshly distilled 
diethylene glycol monomethyl ether dissolved in 50 ml 
absolute pyridine was reacted with 10.01 g (100.0 mmole) 
recrystallized succinic anhydride in the presence of 1.22 g 
(10.0 mmole) 4-N,N-dimethylaminopyridine (DMAP) over- 
night at room temperature. The work-up was as described in 
EXAMPLE 8. Yield was 18.35 g (82.3 mmole, 82.3%). 
11.06 g (50.0 mmole) of succinylated diethylene glycol 
monomethyl ether was transformed into the 4-nitropheny- 
lester and, subsequently, 12.5 mmole was reacted with 2.81 
g (10.0 mmole) of 5-(3-aminopropynyl-l)-2'-deoxyuridine 
as described in EXAMPLE 8. Yield after silica gel column 
chromatography and precipitation into pentane was 3.34 g 
(6.9 mmole, 69%). After dimethoxytritylation and transfor- 
mation into the nucleoside- pVcyanoethylphosphoamidite, 
the mass-modified building block is incorporated into auto- 
mated chemical DNA synthesis according to standard pro- 
cedures. Within the sequence of the nucleic acid primer 
UP 0 " 1 , one or more of the thymidine/uridine residues can be 
substituted by this mass-modified nucleotide. In the case of 
only one incorporated mass-modified nucleotide, the nucleic 
acid primers of EXAMPLES 8 and 9 would have a mass 
difference of 44.05 daltons. 

EXAMPLE 10 

Synthesis of a Nucleic Acid Primer Mass-Modified at C-8 of 
the Heterocyclic Base of Deoxyadenosine with Glycine 

Starting material was N 6 -benzoyl-8-bromo-5'-0-(4,4- 
dimethoxytrityl)-2'-deoxyadenosine prepared according to 
literature (Singh et al., Nucleic Acids Res. 18, 3339-45 
(1990)). 632.5 mg (1.0 mmole) of this 8-bromo-deoxyad- 
enosine derivative was suspended in 5 ml absolute ethanol 
and reacted with 251 .2 mg (2.0 mmole) glycine methyl ester 
(hydrochloride) in the presence of 241.4 mg (2.1 mmole; 
366 ul) N,N-diisopropylethylamine and refluxed until the 
starting nucleosidic material had disappeared (4-6 hours) as 
checked by thin layer chromatography (TLC). The solvent 
was evaporated and the residue purified by silica gel chro- 
matography (column 2.5x50 cm) using solvent mixtures of 
chloroform/methanol containing 0.1% pyridine. The product 
fractions were combined, the solvent evaporated, the frac- 
tions dissolved in 5 ml dichloromethane and precipitated 
into 100 ml pentane. Yield was 487 mg (0.76 mmole, 76%). 
Transformation into the corresponding nucleoside- p-cyano- 
ethylphosphoamidite and integration into automated chemi- 
cal DNA synthesis is performed under standard conditions. 
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During final deprotection with aqueous concentrated ammo- 
nia, the methyl group is removed from the glycine moiety. 
The mass-modified building block can substitute one or 
more deoxyadenosine/adenosine residues in the nucleic acid 
primer sequence. 5 

EXAMPLE 11 

Synthesis of a Nucleic Acid Primer Mass-Modified at C-8 of 
the Heterocyclic Base of Deoxyadenosine with Glycylgly- 10 
cine 

This derivative was prepared in analogy to the glycine 
derivative of EXAMPLE 10. 632.5 mg (1.0 mmole) 
N 6 -Benzoyl-8-bromo-5 , -0-(4,4'-dimethoxytrityl)- 2'-deoxy- 
adenosine was suspended in 5 ml absolute ethanol and 15 
reacted with 324.3 mg (2.0 mmole) glycyl-glycine methyl 
ester in the presence of 241.4 mg (2.1 mmole, 366 ul) 
N,N-diisopropylethylamine. The mixture was refluxed and 
completeness of the reaction checked by TLC. Work-up and 
purification was similar to that described in EXAMPLE 10. 20 
Yield after silica gel column chromatography and precipi- 
tation into pentane was 464 mg (0.65 mmole, 65%). Trans- 
formation into the nucleoside-p-cyanoethylphosphoamidite 
and into synthetic oligonucleotides is done according to 
standard procedures. In the case where only one of the 25 
deoxyadenosine/adenosine residues in the nucleic acid 
primer is substituted by this mass-modified nucleotide, the 
mass difference between the nucleic acid primers of 
EXAMPLES 10 and 11 is 57.03 daltons. 

30 

EXAMPLE 12 

Synthesis of a Nucleic Acid Primer Mass-Modified at the 
C-2" of the Sugar Moiety of 2-amino-2'-Deoxythymidine 
with Ethylene Glycol Monomethyl Ether Residues 35 

Starting material was 5 l -0-(4,4-dimethoxytrityl)-2'- 
amino-2'-deoxythymidine synthesized according to pub- 
lished procedures (e.g., Verheyden et at., J. Org. Chem. 36, 
250-254 (1971); Sasaki et al., J. Org. Chem. 41, 3138-3143 
(1976); Imazawa et al., J. Org. Chem. 44, 2039-2041 40 
(1979); Hobbs et al., J. Org. Chem. 42, 714-719 (1976); 
Ikehara et al., Chem. Pharm. Bull. Japan 26, 240-244 
(1978); see also PCT Application WO 88/00201). 5'-0-(4, 
4-Dimethoxytrityl)-2 , -amino-2 , -deoxythymidine (559.62 
mg; 1 .0 mmole) was reacted with 2.0 mmole of the 4-ni- 45 
trophenyl ester of succinylated ethylene glycol monomethyl 
ether (see EXAMPLE 8) in 10 ml dry DMF in the presence 
of 1.0 mmole (140 ul) triethylamine for 18 hours at room 
temperature. The reaction mixture was evaporated in vacuo, 
co-evaporated with toluene, redissolved in dichloromethane 50 
and purified by silica gel chromatography (Si60, Merck; 
column: 2.5x50 cm; eluent: chloroform/methanol mixtures 
containing 0.1% triethylamine). The product containing 
fractions were combined, evaporated and precipitated into 
pentane. Yield was 524 mg (0.73 mmol; 73%). Transforma- 55 
tion into the nucleoside-pVcyanoethyl-N,N-diisopropylphos- 
phoamidite and incorporation into the automated chemical 
DNA synthesis protocol is performed by standard proce- 
dures. The mass-modified deoxythymidine derivative can 
substitute For one or more of the thymidine residues in the 60 
nucleic acid primer. 

In an analogous way, by employing the 4-nitrophenyl 
ester of succinylated diethylene glycol monomethyl ether 
(see EXAMPLE 9) and triethylene glycol monomethyl 
ether, the corresponding mass-modified oligonucleotides are 65 
prepared. In the case of only one incorporated mass-modi- 
fied nucleoside within the sequence, the mass difference 


between the ethylene, diethylene and triethylene glycol 
derivatives is 44.05, 88.1 and 132.15 daltons respectively. 

EXAMPLE 13 

Synthesis of a Nucleic Acid Primer Mass-Modified in the 
Internucleotidic Linkage Via Alkylation of Phosphorothio- 
ate Groups 

Phosphorothioate-containing oligonucleotides were pre- 
pared according to standard procedures (see e.g. Gait et al., 
Nucleic Acids Res., 19 1183 (1991)). One, several or all 
internucleotide linkages can be modified in this way. The 
(-)-M13 nucleic acid primer sequence (17-mer) 5'-dG- 
TAAAACGACGGCCAGT was synthesized in 0.25 umole 
scale on a DNA synthesizer and one phosphorothioate group 
introduced after the final synthesis cycle (G to T coupling). 
Sulfurization, deprotection and purification followed stan- 
dard protocols. Yield was 31.4 nmole (12.6% overall yield), 
corresponding to 31.4 nmole phosphorothioate groups. 
Alkylation was performed by dissolving the residue in 31.4 
ul TE buffer (0.01 M Tris pH 8.0, 0.001 M EDTA) and by 
adding 16 ul of a solution of 20 mM solution of 2-iodoet- 
hanol (320 nmole; i.e., 10-fold excess with respect to 
phosphorothioate diesters) in N,N-dimethylformamide 
(DMF). The alkylated oligonucleotide was purified by stan- 
dard reversed phase HPLC (RP-18 Ultraphere, Beckman; 
column: 4.5x250 mm; 100 mM tricthyl ammonium acetate, 
pH 7.0 and a gradient of 5 to 40% acetonitrile). 

In a variation of this procedure, the nucleic acid primer 
containing one or more phosphorothioate phosphodiester 
bond is used in the Sanger sequencing reactions. The primer- 
extension products of the four sequencing reactions are 
purified as exemplified in EXAMPLES 1-4, cleaved off the 
solid support, lyophilized and dissolved in 4 ul each of TE 
buffer pH 8.0 and alkylated by addition of 2 jil of a 20 mM 
solution of 2-iodoethanol in DMF. It is then analyzed by ES 
and/or MALDI mass spectrometry. 

In an analogous way, employing instead of 2-iodoethanol, 
e.g., 3-iodopropanol, 4-iodobutanol mass-modified nucleic 
acid primer are obtained with a mass difference of 14.03, 
28.06 and 42.03 daltons respectively compared to the 
unmodified phosphorothioate phosphodiester-containing 
oligonucleotide. 

EXAMPLE 14 

Synthesis of 2'-amino-2'-deoxyuridine-5 l -triphosphate and 
3 , -amino-2 , ,3' -dideoxythymidine-5'-triphosphate Mass- 
Modified at the T- or 3 -amino Function with Glycine or 
B-Alanine Residues 

Starting material was 2'-azido-2'-deoxyuridine prepared 
according to literature (Verheyden et al., J. Org. Chem. 36, 
250 (1971)), which was 4,4-dimethoxytritylated at 5-OH 
with 4,4-dimethoxytrityl chloride in pyridine and acetylated 
at 3'-OH with acetic anhydride in a one-pot reaction using 
standard reaction conditions. With 191 mg (0.71 mmole) 
2'-azido-2'-deoxyuridine as starting material, 396 mg (0.65 
mmol, 90.8%) 5 , -0-(4,4-dimethoxytrityl)-3'-0-acetyl-2'- 
azido-2'-deoxuridine was obtained after purification via 
silica gel chromatography. Reduction of the azido group was 
performed using published conditions (Barta et al., Tetra- 
hedron 46, 587-594 (1990)). Yield of 5 , -0-(4,4-dimethox- 
ytrityl)-3'-0-acetyl-2'-amino-2 r -deoxyuridine after silica gel 
chromatography was 288 mg (0.49 mmole; 76%). This 
protected 2'-amino-2 , -deoxyuridine derivative (588 mg, 1.0 
mmole) was reacted with 2 equivalents (927 mg, 2.0 mmole) 
N-Fmoc-glycine pentafluorophenyl ester in 10 ml dry DMF 
overnight at room temperature in the presence of 1.0 mmole 
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(174 N,N-diisopropylethylamine. Solvents were 

removed by evaporation in vacuo and the residue purified by 
silica gel chromatography. Yield was 711 mg (0.71 mmole, 
82%). Detritylation was achieved by a one hour treatment 
with 80% aqueous acetic acid at room temperature. The s 
residue was evaporated to dryness, co-evaporated twice with 
toluene, suspended in 1 ml dry acetonitrile and 5'-phospho- 
rylated with POCl 3 according to literature (Yoshikawa et al., 
Bull. Chem. Soc. Japan 42, 3505 (1969) and Sowa et al., 
Bull. Chem. Soc. Japan 48, 2084 (1975)) and directly 10 
transformed in a one-pot reaction to the 5 '-triphosphate 
using 3 ml of a 0.5M solution (1.5 mmole) tetra (tri-n- 
butylammonium) pyrophosphate in DMF according to lit- 
erature (e.g. Seela et al., Helvetica Chimica Acta 74, 1048 
( 199 1 )). The Fmoc and the 3 '-O-acetyl groups were removed 15 
by a one-hour treatment with concentrated aqueous ammo- 
nia at Toom temperature and the reaction mixture evaporated 
and lyophilized. Purification also followed standard proce- 
dures by using anion-exchange chromatography on DEAE- 
Sephadex with a linear gradient of triethylammonium bicar- 20 
bonate (0.1M-1.0M). Triphosphate containing fractions 
(checked by thin layer chromatography on polyethylene- 
imine cellulose plates) were collected, evaporated and lyo- 
philized. Yield (by UV-absorbance of the uracil moiety) was 
68% (0.48 mmole). 25 

A glycyl-glycine modified 2 , -amino-2'-deoxyuridine-5'- 
triphosphate was obtained by removing the Fmoc group 
from 5-0-(4,4-dimethoxytrityI)-3 , -0-acetyl-2'-N-(N-9- 
fluorenylmethyloxycarbonyl-glycyl)-2'-amino-2'-deoxyuri- 
dine by a one-hour treatment with a 20% solution of 30 
piperidine in DMF at room temperature, evaporation of 
solvents, two-fold co-evaporation with toluene and subse- 
quent condensation with N-Fmoc-glycine pentafluorophenyl 
ester. Starting with 1.0 mmole of the 2'-N-glycyl-2'-amino- 
2 , -deoxyuridine derivative and following the procedure 35 
described above, 0.72 mmole (72%) of the corresponding 
2 , -(N-glycyl-glycyl)-2 , -amino-2'-deoxyuridine-5'-triphos- 
phate was obtained. 

Starting with 5'-0-(4,4-dimethoxytrityl)-3'-0-acetyl-2'- 
amino-2'-deoxyuridine and coupling with N-Fmoc-Pg-ala- 40 
nine pentafluorophenyl ester, the corresponding 2'-(N-p- 
alanyl>2'-amino-2 , -deoxyuridine-5 '-triphosphate can be 
synthesized. These modified nucleoside triphosphates are 
incorporated during the Sanger DNA sequencing process in 
the primer-extension products. The mass difference between 45 
the glycine, P-alanine and glycyl-glycine mass-modified 
nucleosides is, per nucleotide incorporated, 58.06, 72.09 and 
115.1 daltons respectively. 

When starting with 5'-0-(4,4-dimethoxytrityl)-3'-arnino- 
2',3'-dideoxythymidine (obtained by published procedures, so 
see EXAMPLE 12), the corresponding 3'-(N-glycyl)-3'- 
amino-/3'-(-N-glycyl-glycyl)-3'-arriirio-/and 3 '-(N-p-alanyl)- 
3'-amino-2^3'-dideoxythymidine-5'-triphosphates can be 
obtained. These mass-modified nucleoside triphosphates 
serve as a terminating nucleotide unit in the Sanger DNA 55 
sequencing reactions providing a mass difference per termi- 
nated fragment of 58.06, 72.09 and 115.1 daltons respec- 
tively when used in the multiplexing sequencing mode. The 
mass-differentiated fragments can then be analyzed by ES 
and/or MALDI mass spectrometry. 60 

EXAMPLE 15 

Synthesis of Deoxyuridine-5'-triphosphate Mass-Modified 
at C-5 of the Heterocyclic Base with Glycine, Glycyl- 
Glycine and ^-Alanine Residues 65 

0.281 g (1.0 mmole) 5-(3-Aminopropynyl-l)-2'-deox- 
yuridine (see EXAMPLE 6) was reacted with either 0.927 g 


(2.0 mmole) N-Fmoc-glycine pentafluorophenyl ester or 
0.955 g (2.0 mmole) N-Fmoc-P-alanine pentafluorophenyl 
ester in 5 ml dry DMF in the presence of 0.129 g N,N- 
diisopropylethylamine (174 ul, 1.0 mmole) overnight at 
room temperature. Solvents were removed by evaporation in 
vacuo and the condensation products purified by flash chro- 
matography on silica gel (Still et aL, J. Org. Chem. 43, 
2923-2925 (1978)). Yields were 476 mg (0.85 mmole: 85%) 
for the glycine and 436 mg (0.76 mmole; 76%) for the 
p-alanine derivatives. For the synthesis of the glycyl-glycine 
derivative, the Fmoc group of 1.0 mmole Fmoc-glycine- 
deoxyuridine derivative was removed by one-hour treatment 
with 20% piperidine in DMF at room temperature. Solvents 
were removed by evaporation in vacuo, the residue was 
co-evaporated twice with toluene and condensed with 0.927 
g (2.0 mmole) N-Fmoc-glycine pentafluorophenyl ester and 
purified as described above. Yield was 445 mg (0.72 mmole; 
72%). The glycyl-, glycyl-glycyl- and p-alanyl-2'-deoxyuri- 
dine derivatives, N-protected with the Fmoc group were 
transformed to the 3'-0~acetyl derivatives by tritylation with 
4,4-dimethoxytrityl chloride in pyridine and acetylation with 
acetic anhydride in pyridine in a one-pot reaction and 
subsequently detritylated by one hour treatment with 80% 
aqueous acetic acid according to standard procedures. Sol- 
vents were removed, the residues dissolved in 100 ml 
chloroform and extracted twice with 50 ml 10% sodium 
bicarbonate and once with 50 ml water, dried with sodium 
sulfate, the solvent evaporated and the residues purified by 
flash chromatography on silica gel. Yields were 361 mg 
(0.60 mmole; 71%) for the glycyl-, 351 mg (0.57 mmole; 
75%) for the p-alanyl- and 323 mg (0.49 mmole; 68%) for 
the glycyl-glycyl-3-0'-acetyl-2'-deoxyuridine derivatives 
respectively. Phosphorylation at the 5'-OH with POCl 3 , 
transformation into the 5 '-triphosphate by in-situ reaction 
with tetra(tri-n-butylammonium) pyrophosphate in DMF, 
3'de-O-acetylation, cleavage of the Fmoc group, and final 
purification by anion-exchange chromatography on DEAE- 
Sephadex was performed as described in EXAMPLE 14. 
Yields according to UV-absorbance of the uracil moiety 
were 0.41 mmole 5-(3-(N-glycyl)-amidopropynyl- l)-2'- 
deoxyuridine-S'-triphosphate (84%), 0.43 mmole 5-(3-(N-P- 
alanyl)~amidopropynyl- 1 )-2'-deoxyuridine-5 , -triphosphate 
(75%) and 0.38 mmole 5-(3 -(N-glycyl-glycyl)-amidopro- 
pynyl- l)-2'-deoxyiiridine-5'-triphosphate (78%). 

These mass-modified nucleoside triphosphates were 
incorporated during the Sanger DNA sequencing primer- 
extension reactions. 

When using 5-(3-arru^opropynyl-l)~2 , ,3'-dideoxyuridine 
as starting material and following an analogous reaction 
sequence the corresponding glycyl-, glycyl-glycyl- and 
p-alanyl-2\3 , -dideoxyuridine-5'-triphosphates were 
obtained in yields of 69, 63 and 71% respectively. These 
mass-modified nucleoside triphosphates serve as chain-ter- 
minating nucleotides during the Sanger DNA sequencing 
reactions. The mass-modified sequencing ladders are ana- 
lyzed by either ES or MALDI mass spectrometry. 

EXAMPLE 16 

Synthesis of 8-gIycyl- and 8-glycyl-glycyl-2'-deoxyadenos- 
ine-5'-triphosphate 

727 mg (1.0 mmole) of N <s -(4-tert-butylphenoxyaceryl)- 
8-glycyl-5'-(4,4 -dimethoxytrityl)-2'-deoxyadenosine or 800 
mg (1.0 mmole) N 6 -(4-tert-butylphenoxyacetyl)- 8-glycyl- 
glycyl-5 , -(4,4-dimethoxytrityl)-2'-deoxyadenosine prepared 
according to EXAMPLES 10 and 11 and literature (K 
oster et ai., Tetrahedron 37, 362 (1981)) were acetylated 
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with acetic anhydride in pyridine at the 3"-OH, detritylated 
at the 5'-position with 80% acetic acid in a one-pot reaction 
and transformed into the 5'-triphosphates via phosphoryla- 
tion with POCl 3 and reaction in-situ with tetra(tri-n-buty- 
1 ammonium) pyrophosphate as described in EXAMPLE 14. 5 
Deprotection of the N 6 -tert-butylphenoxyacetyl, the 3'~0- 
acetyl and the O-methyl group at the glycine residues was 
achieved with concentrated aqueous ammonia for ninety 
minutes at room temperature. Ammonia was removed by 
lyophilization and the residue washed with dichlo- 10 
romethane, solvent removed by evaporation in vacuo and 
the remaining solid material purified by anion-exchange 
chromatography on DEAE-Sephadex using a linear gradient 
of triethylammonium bicarbonate from 0.1 to 1.0M. The 
nucleoside triphosphate containing fractions (checked by is 
TLC on polyethyleneimine cellulose plates) were combined 
and lyophillized. Yield of the 8-glycyl-2'-deoxyadenosine- 
5-triphosphate (determined by UV-absorbance of the 
adenine moiety) was 57% (0.57 mmole). The yield for the 
8-glycyl-glycyl-2'-deoxyadenosine-5'-triphosphate was 20 
51% (0.51 mmole). 

These mass-modified nucleoside triphosphates were 
incorporated during primer-extension in the Sanger DNA 
sequencing reactions. 

When using the corresponding N 6 -(4-tert-butylphenoxy- 25 
acetyl)-8-glycyl- or -glycyl-glycyl-5'-0-(4,4-dimethoxytri- 
tyl)-2',3'-dideoxyadenosine derivatives as starting materials 
prepared according to standard procedures (see, e.g., for the 
introduction of the 2 , 3 -function: Seela et al., Helvetica 
Chimica Acta 74, 1048-1058 (1991)) and using an analo- 30 
gous reaction sequence as described above, the chain-ter- 
minating mass-modified nucleoside triphosphates 8-glycyl- 
and 8-glycyl-glycyl-2\3'-dideoxyadenosine-5'-triphosphates 
were obtained in 53 and 47% yields respectively. The 
mass-modified sequencing fragment ladders are analyzed by 35 
either ES or MALDI mass spectrometry. 

EXAMPLE 17 

Mass-Modification of Sanger DNA Sequencing Fragment 40 
Ladders by Incorporation of Chain-Elongating 2'-deoxy- 
and Chain-Terminating 2 , ,3'-dideoxythymidine-5'-(alpha-S- 
)-triphosphate and Subsequent Alkylation with 2-iodoetha- 
nol and 3-iodopropanol 

2',3 , -Dideoxythymidine-5 -(alpha-S)-triphosphate was 45 
prepared according to published procedures (e.g., for the 
alpha-S -triphosphate moiety: Eckstein et al., Biochemistry 
15, 1685 (1976) and Accounts Chem. Res, 12, 204 (1978) 
and for the 2',3-dideoxy moiety: Seela et al., Helvetica 
Chimica Acta, 74, 1048-1058 (1991)). Sanger DNA 50 
sequencing reactions employing 2 , -deoxythymidine~5'-(al- 
pha-S) -triphosphate are performed according to standard 
protocols (e.g. Eckstein, Ann, Rev, Biochem, 54, 367 
(1985)). When using 2 , ,3'-dideoxythymidine-5'-(alpha-S)- 
triphosphates, this is used instead of the unmodified 2\3'- 55 
dideoxythymidine-5'-triphosphate in standard Sanger DNA 
sequencing (see e.g. Swerdlow et al., Nucleic Acids Res, 18, 
1415-1419 (1990)). The template (2 pmole) and the nucleic 
acid M13 sequencing primer (4 pmole) modified according 
to EXAMPLE 1 are annealed by heating to 65° C. in 1 00 ul 60 
of 10 mM Tris-HCl pH 7.5, 10 mM MgCl 2 , 50 mM NaCl, 
7 mM dithiothreitol (DTT) for 5 min and slowly brought to 
37° C. during a one hour period. The sequencing reaction 
mixtures contain, as exemplified for the T-specific termina- 
tion reaction, in a final volume of 150 ul, 200 uM (final 65 
concentration) each of dATP, dCTP, dTTP, 300 uM 
c7-deaza-dGTP, 5 uM 2 , ,3 , -dideoxythymidine-5 , -(alpha-S)- 
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triphosphate and 40 units Sequenase (United States Bio- 
chemicals). Polymerization is performed for 10 min at 37° 
C, the reaction mixture heated to 70° C. to inactivate the 
Sequenase, ethanol precipitated and coupled to thiolated 
Sequelon membrane disks (8 mm diameter) as described in 
EXAMPLE 1. Alkylation is performed by treating the disks 
with 10 ul of 10 mM solution of either 2-iodoethanol or 
3-iodopropanol in NMM (N-methyImorpholine/water/2- 
propanol, 2/49/49, v/v/v) (three times), washing with 10 ul 
NMM (three times) and cleaving the alkylated T-terminated 
primer-extension products off the support by treatment with 
DTT as described in EXAMPLE 1. Analysis of the mass- 
modified fragment families is performed with either ES or 
MALDI mass spectrometry. 

EXAMPLE 18 

Analysis of a Mixture of Oligothymidylic Acids 

Oligothymidylic acid, oligo p(dT) 12 . lg , is commercially 
available (United States Biochemical, Cleveland, Ohio). 
Generally, a matrix solution of 0.5M in ethanol was pre- 
pared. Various matrices were used for this Example and 
Examples 19-21 such as 3,5-dihydroxybenzoic acid, sina- 
pinic acid, 3-hydroxypicolinic acid, 2,4,6-trihydroxyac- 
etophenone. Oligonucleotides were lyophilized after purifi- 
cation by HPLC and taken up in ultrapure water (MilliQ, 
Millipore) using amounts to obtain a concentration of 10 
pmoles/ul as stock solution. An aliquot (1 pi) of this con- 
centration or a dilution in ultrapure water was mixed with 1 
ul of the matrix solution on a flat metal surface serving as the 
probe tip and dried with a fan using cold air. In some 
experiments, cation-ion exchange beads in the acid form 
were added to the mixture of matrix and sample solution. 

MALDI-TOF spectra were obtained for this Example and 
Examples 19-21 on different commercial instruments such 
as Vision 2000 (Finnigan-MAT), VG TofSpec (Fisons 
Instruments), LaserTec Research (Vestec). The conditions 
for this Example were linear negative ion mode with an 
acceleration voltage of 25 kV. The MALDI-TOF spectrum 
generated is shown in FIG. 14. Mass calibration was done 
externally and generally achieved by using defined peptides 
of appropriate mass range such as insulin, gramicidin S, 
trypsinogen, bovine serum albumen, and cytochrome C. All 
spectra were generated by employing a nitrogen laser with 
5 nsec pulses at a wavelength of 337 nm. Laser energy 
varied between 10 6 and 10 7 W/cm 2 . To improve signal-to- 
noise ratio generally, the intensities of 10 to 30 laser shots 
were accumulated. 

EXAMPLE 19 

Mass Spectrometric Analysis of a 50-mer and a 99-mer 

Two large oligonucleotides were analyzed by mass spec- 
trometry. The 50-mer d (TAACGGTCATTACGGCCAT- 
TGACTGTAGGACCTGCATTACATGACTAGCT) (SEQ 
ID NO:3) and dT(pdT) 99 were used. The oligodeoxynucle- 
otides were synthesized using P -cyanoethylphosphoamid- 
ites and purified using published procedures. (e.g. N. D. 
Sinha, J. Bieraat, J. McManus and H. Koster, Nucleic Acids 
Res., 12, 4539 (1984)) employing commercially available 
DNA synthesizers from either Millipore (Bedford, Mass.) or 
Applied Biosystems (Foster City, Calif.) and HPLC equip- 
ment and RP 18 reverse phase columns from Waters (Mil- 
ford, Mass.). The samples for mass spectrometric analysis 
were prepared as described in Example 18. The conditions 
used for MALDI-MS analysis of each oligonucleotide were 
500 fmol of each oligonucleotide, rellectron positive ion 
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mode with an acceleration of 5 kV and postacceleration of 
20 kV. The MALDI-TOF spectra generated were superim- 
posed and are shown in FIG. IS. 

EXAMPLE 20 

Simulation of the DNA Sequencing Results of FIG. 2 

The 13 DNA sequences representing the nested dT-ter- 
minated fragments of the Sanger DNA sequencing for the 
50-mer described in Example 19 (SEQ ID NO:3) were 
synthesized as described in Example 19. The samples were 
treated and 500 fmol of each fragment was analyzed by 
MALDI-MS as described in Example 18. The resulting 
MALDI-TOF spectra are shown in FIG. lfi. The conditions 
were refiectron positive ion mode with an acceleration of 5 
kV and postacceleration of 20 kV. Calculated molecular 
masses and experimental molecular masses are shown in 
Table 1. 

The MALDI-TOF spectra were superimposed (FIG. 17) 
to demonstrate that the individual peaks are resolvable even 
between the 10-mer and 11-mer (upper panel) and the 
37-mer and 38-mer (lower panel). The two panels show two 
different scales and the spectra analyzed at that scale. 

EXAMPLE 21 

MALDI-MS Analysis of a Mass-Modified Oligonucleotide 
A 17-mer was mass-modified at C-S of one or two 
deoxyuridine moieties. 5-[13 -(2-MethoxyethoxyI)-tride- 
cy ne- 1 -yll-S'-O-C^'-dimethoxytrity^-T-deoxyuridine-S'- 
P^cyanoethyl-N,N-diisopropylphosphoamidite was used to 


synthesize the modified 17-mers using the methods 
described in Example 19. 
The modified 17-mers were 

x 
I 

a: d (TAAAACGACGGCCAGUG) (molecular mass: 5454) 
(SEQIDNO:4) 


b: d (UAAAACGACGGCCAGUG) (molecular mass 5634) 
(SEQIDNO:5) 


where X=— teC— (CH 2 ) 11 — OH 
(unmodified 17-mer: molecular mass: 5273) 
The samples were prepared and 500 fmol of each modi- 
fied 17-mer was analyzed using MALDI-MS as described in 
Example 18. The conditions used were refiectron positive 
ion mode with an acceleration of 5 kV and postacceleration 
of 20 kV. The MALDI-TOF spectra which were generated 
were superimposed and are shown in FIG. 18. 

All of the above-cited references and publications are 
hereby incorporated by reference. 

EQUIVALENTS 

Those skilled in the art will recognize, or be able to 
ascertain using no more than routine experimentation, 
numerous equivalents to the specific procedures described 
herein. Such equivalents are considered to be within the 
scope of this invention and are covered by the following 
claims. 


SEQUENCE LISTING 


( 1 ) GENERAL INFORMATION: 

(Mi ) NUMBER OF SEQUENCES: 5 

( 2 ) INFORMATION FOR SEQ ID NO:l: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 14 base pairs 
( B ) TYPE: nucleic arid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: oiicr nucleic acid 

( i i i ) HYPOTHETICAL: YES 

( x i) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

CATGCCATGG CATG 


( 2 ) INFORMATION FOR SEQ ID NO:2: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 21 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: other nucleic acid 

( i i i ) HYPOTHETICAL: YES 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:2: 


AAATTGTGCA CATCCTGCAG C 
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( 2 ) INFORMATION FOR SEQ ID NO:3: 

( i ) SEQUENCE CHARACTERISTICS: 
{ A ) LENGTH: 50 base pairs 
( R ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

(ii) MOLECULE TYPE: olhcr nucleic acid 

( i i i ) HYPOTHETICAL: YES 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

TAACGGTCAT TACGGCCATT GACTGTAGGA CCTGCATTAC ATGACTAGCT 50 


( 2 ) INFORMATION FOR SEQ ID NO:4: 

( i ] SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: IB base pairs 
( B ) TYPE; nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: other nucleic acid 

( i i i ) HYPOTHETICAL: YES 

(xt) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

TAAAACGACG GGCCAGNG 


C 2 ) INFORMATION FOR SEQ ID NO:5: 

( i ) SEQUENCE CHARACTERISTICS: 
C A ) LENGTH: 18 base pairs 
( B J TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
C D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: other nucleic acid 

( i i i ) HYPOTHETICAL; YES 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:5: 


I claim: 

1. A method for determining the sequence of a nucleic 45 
acid, comprising the steps of: 

a) generating at least two conditioned, base-specifically 
terminated nucleic acid fragments from a nucleic acid 
to be sequenced; 

b) determining the molecular weight value of each base- 
specifically terminated fragment by mass spectrometry, 
wherein the molecular weight values of at least two 
base- specifically terminated fragments are determined 
concurrently; and 55 

c) determining the sequence of the nucleic acid by align- 
ing the base-specifically terminated nucleic acid frag- 
ments according to molecular weight. 

2. The method according to claim 1 wherein the nucleic 
acid fragments are purified before the step of determining 60 
the molecular weight values by mass spectrometry. 

3. The method according to claim 2 wherein the nucleic 
acid fragments are purified, comprising the steps of: 

i) reversibly immobilizing the nucleic acid fragments on 65 
a solid support; and 

ii) washing out all remaining reactants and by-products. 


4. The method according to claim 3, further comprising 
the step of removing the nucleic acid fragments from the 
solid support. 

5. The method according to claim 3, wherein each frag- 
ment is coupled by the linking group (L) to a functionality 
(L') on the support creating a temporary and cleavable 
attachment of the nucleic acid fragments to the support. 

6. The method according to claim 5, wherein the base- 
specifically terminated nucleic acid fragments are cleaved 
from the solid support prior to mass spectrometry. 

7. The method according to claim 5, wherein the base- 
specifically terminated nucleic acid fragments are cleaved 
from the solid support during mass spectrometry. 

8. The method according to claim 5, wherein the tempo- 
rary and cleavable attachment can be cleaved enzymatically, 
chemically or physically. 

9. The method according to claim 8, wherein the tempo- 
rary and cleavable attachment is selected from the group 
consisting of a photocleavable bond, a bond based on strong 
electrostatic interaction, a tritylether bond, a p-benzoylpro- 
pionyl group, a levulinyl group, a disulfide bond, an argin- 
ine/arginme bond, a lysine/lysine bond, a pyrophosphate 
bond, and a bond created by Watson-Crick base pairing. 

10. The method according to claim 1, wherein step a), the 
nucleic acid fragments are conditioned by cation exchange. 
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11. The method according to claim 1, wherein step a), the 
nucleic acid fragments are conditioned by mass modifica- 
tion. 

12. The method according to claim 11, wherein each 
nucleic acid fragment is synthesized using a nucleic acid 5 
primer; in the presence of chain-terminating and chain- 
elongating deoxy nucleotides; and wherein at least one 
chain-elongating deoxynucleotide is selected from the group 
consisting of deoxy adenosine triphosphate dATP, deoxythy- 
midine triphosphate dTTP, deoxyguanosine triphosphate 10 
dGTP, deoxycytidine triphosphate dCTP, deoxyinosine 
triphosphate dITP, a 7-deazadeoxyguanosine triphosphate 
c 7 dGTP, a 7-deazadeoxyadenosine triphosphate c 7 dATP, and 

a 7-deazadeoxyinosine triphosphate c 7 dITP; at least one 
chain-terminating dideoxynucleotide selected from the 15 
group consisting of dideoxyadenosine triphosphate ddATP, 
dideoxythymidine triphosphate ddTTP, dideoxyguanosine 
triphosphate ddGTP, and dideoxycytidine triphosphate 
ddCTP; and a DNA polymerase. 

13. The method according to claim 12, wherein the 20 
nucleic acid primer further includes a linking group (L) for 
reversibly immobilizing the primer on a solid support. 

14. The method according to claim 11, wherein each 
nucleic acid fragment is synthesized using chain terminating 
and chain elongating nucleotides and wherein at least one 25 
chain elongating nucleotide is selected from the group 
consisting of adenosine triphosphate (ATP), uridine triph- 
osphate (UTP), guanosine triphosphate (GTP), cytidine 
triphosphate (OTP), inosine triphosphate (FTP), a 7-deazaad- 
enosine triphosphate (c 7 ATP), a 7-deazaguanosine triphos- 30 
phate (c 7 GTP), and a 7-deazainosine triphosphate (c 7 ITP); 
and at least one chain-terminating 3'-deoxynucleotide 
selected from the group consisting of deoxyadenosine triph- 
osphate 3'-dATP, deoxyuridine triphosphate 3'-dUTP, deox- 
yguanosine triphosphate 3'-dGTP, and deoxycytidine triph- 35 
osphate 3'-dCTP); and an RNA polymerase. 

15. The method according to claim 1, wherein the molecu- 
lar weight value of each nucleic acid fragment is determined 
by matrix-assisted laser desorption/ionization mass spec- 
trometry (MALDI-MS). 4Q 

16. The method according to claim 1 in which the 
molecular weight value of each nucleic acid fragment is 
determined by electrospray mass spectrometry (ES-MS). 

17. The method of claim 1, wherein step b) is performed 
without first performing an electrophoretic separation. 45 

18. A method of claim 1, wherein the base-specifically 
terminated nucleic acid fragments are conditioned by 
removing the negative charge from the phosphodiester back- 
bone. 

19. A method of claim 1, wherein the base-specifically 50 
terminated nucleic acid fragments are conditioned by puri- 
fication. 

20. The method according to claim 1, wherein more than 
one species of nucleic acid are concurrently sequenced by 
multiplex mass spectrometric nucleic acid sequencing 55 
employing tag probes, nucleic acid primers, chain-elongat- 
ing nucleotides, and chain-terminating nucleotides, wherein 
one of the sets of base- specifically terminated fragments is 
unmodified and the other sets of base-specifically terminated 
fragments are mass modified, and each of the sets of 60 
base-specifically terminated nucleic acid fragments has a 
sufficient mass difference to be distinguished from the others 
by mass spectrometry. 

21. The method according to claim 20, wherein at least 
one of the sets of mass-modified base-specifically termi- 65 
nated fragments is modified with a mass-modifying func- 
tionality (M) at a heterocyclic base of at least one nucleotide. 


22. The method according to claim 21, wherein the 
heterocyclic base-modified nucleotide is selected from the 
group consisting of a cytosine nucleotide modified at C-5, a 
thymine nucleotide modified at C-5, a thymine nucleotide 
modified at the C-5 methyl group, a uracil nucleotide 
modified at C-5, an adenine nucleotide modified at C-8, a c 7 
-deazaadenine modified at C-S, a c 7 -deazaadenine modified 
at C-7, a guanine nucleotide modified at C-8, a c 7 -deaza- 
guanine modified at C-8, a c 7 -deazaguanine modified at C-7, 
a hypoxanthine modified at C-8, a c 7 -deazahypoxanthine 
modified at C-7, and a c 7 -deazahypoxanthine modified at 
C-8. 

23. The method according to claim 20, wherein at least 
one of the sets of mass-modified base-specifically termi- 
nated nucleic acid fragments is modified with a mass- 
modifying functionality (M) attached to one or more phos- 
phate moieties of the internucleotidic linkages of the 
fragments. 

24. The method according to claim 20, wherein at least 
one of the sets of mass-modified base-specifically termi- 
nated nucleic acid fragments is modified with a mass- 
modifying functionality (M) attached to one or more sugar 
moieties of nucleotides within the set of mass modified 
base-specifically terminated fragments at at least one sugar 
position selected from the group consisting of a C-2' posi- 
tion, an external C-3' position, and an external C-5' position. 

25. The method according to claim 20, wherein at least 
one of the sets of mass-modified base- specifically termi- 
nated nucleic acid fragments is modified with a mass- 
modifying functionality (M) attached to the sugar moiety of 
a 5'-tenninal nucleotide and wherein the mass-modifying 
function (M) is the linking functionality (L). 

26. The method according to claim 20, wherein a mass- 
modifying functionality (M) is attached to a set of base- 
specifically terminated nucleic acid fragments subsequent to 
generating the base-specifically terminated nucleic acid 
fragments and prior to deterarming the molecular weight 
values for the nested fragments by mass spectrometry. 

27. The method according to claim 26, wherein the 
base-specifically terminated nucleic acid fragments are gen- 
erated using at least one reagent selected from the group 
consisting of a nucleic acid primer, a chain-elongating 
nucleotide, a chain-terminating nucleotide, a tag probe 
which has been modified with a precursor of the mass- 
modifying functionality, M; and a subsequent step com- 
prises modifying the precursor of the mass-modifying func- 
tionality, M, to generate the mass-modifying functionality, 
M, prior to mass spectrometric analysis. 

28. The method according to claim 20, wherein mass 
differentiation of the tag probes is achieved by changing the 
nucleotide composition of at least one of the tag probes and 
complementary tag sequence in the species of nucleic acid. 

29. The method according to claim 20, wherein the tag 
probes are covalently bound to the corresponding comple- 
mentary tag sequence prior to mass spectrometric analysis. 

30. The method according to claim 29, wherein binding 
between the tag probes and the corresponding complemen- 
tary tag sequences is achieved photochemically via photo- 
activatable groups. 

31. A method of sequencing a nucleic acid, comprising the 
steps of: 

a) reversibly linking an oligonucleotide primer to a solid 
support; 

b) generating at least two conditioned, base-specifically 
terminated nucleic acid fragments; 

c) determining the molecular weight value of each nested 
fragment in each of the four sets of base-specifically 
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terminated fragments by matrix assisted laser desorp- 
tion/ionization mass spectrometry wherein the molecu- 
lar weight values of at least two base-specifically 
terminated fragments are determined concurrently and 
wherein the nested fragments are cleaved from the solid 5 
support by a laser during mass spectrometry; and 
d) determining the nucleotide sequence by aligning the 
base specifically terminated fragments according to 
molecular weight. 

32. The method according to claim 31, wherein the 10 
base-specifically terminated fragments are cleaved from the 
solid support prior to mass spectrometry. 

33. The method according to claim 31, wherein the 
base- specifically terminated fragments are cleaved from the 
solid support during mass spectrometry. 15 

34. The method according to claim 31, wherein the 
reversible linkage is a photocleavable bond. 

35. The method according to claim 31, wherein step b), 
the nucleic acid fragments are conditioned by cation 
exchange. 20 

36. The method according to claim 31, wherein step b), 
the nucleic acid fragments are conditioned by mass modi- 
fication. 

37. The method according to claim 31, wherein the 
base-specifically terminated fragments are conditioned by 25 
purification. 

38. The method according to claim 31, wherein the 
base-specifically terminated fragments are conditioned by 
removal of the negative charge of the phosphodiester back- 
bone. 30 

39. A method of multiplex analysis of nucleic acid 
sequences, comprising the steps of: 

a) reversibly linking a nucleic acid primer to a solid 
support; 
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b) generating at least two conditioned, base-specifically 
terminated nucleic acid fragments; 

c) determining the molecular weight value of each frag- 
ment by matrix assisted laser desorption/ionization 
mass spectrometry wherein the molecular weight val- 
ues of at least two base-specifically terminated frag- 
ments are determined concurrently and wherein the 
fragments are cleaved from the solid support by a laser 
during mass spectrometry; and 

d) determining the nucleotide sequence by aligning the 
fragments according to molecular weight; wherein at 
least one reagent selected from a group consisting of, a 
nucleic acid primer, a chain-elongating nucleotide, and 
a chain-terminating nucleotide which has been mass- 
modified; wherein each set of base- specifically termi- 
nated fragments has a sufficient mass difference from 
the other sets of base- specifically terminated fragments 
so as to be unique; and wherein the molecular weight 
values of the nested fragments of two or more sets of 
unseparated base-specifically terminated fragments are 
determined concurrently. 

40. The method according to claim 39, wherein the 
reversible linkage is a photocleavable bond. 

41. The method according to claim 39, wherein the 
base-specifically terminated fragments are cleaved from the 
solid support prior to mass spectrometry. 

42. The method according to claim 39, wherein the 
base-specifically terminated fragments are cleaved from the 
solid support during mass spectrometry. 


